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(54) : Isolation of PCI and EISA masters by masking control and interrupt lines 



(57) A system management module (SMM) for a 
host server svstem includes a system management 
processor (SMP) connected to a system management 
local bus. The system management local bus connects 
to the system PCI bus through a system management 
central (SMC). The SMC includes the main arbitration 
unit for the PCI bus and also includes the arbiter for the 
system management local bus. The SMM includes a 
video controller and/or keyboard and mouse controller 
cmnected to the system management local bus to sup- 
port remote consoling of the SMM. The SMC Includes 



logic to monitor PCI cycles and to issue error signals in 
the event of a system error. The SMC also isolates failed 
components by masking request grant and Interrupt 
lines for the failed device. Further, if a spare component 
is provided, the SMC permits dynamic switching to the 
spare. In addition to detecting errors and performing sur- 
vival and maintenance operations, the SMC enhanc s 
system performance during normal operations by sup- 
porting rnaster-target priority determinatk)n8 to mor ef- 
ficiently arbitrate mastership of system busses such as 
the PCI bus. 
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D scriptlon 



Th present invention relates g n rally to a host computer that functions as a server for a disUibuted computing 
system More particularly, the present Invention relates to a diagnostic subsystem for the host computer that can be 

s directlyorremotelyaccessedbyasyst madministrator.andwhlchpermitsthesystemadministratorto ntercommands 
to the host system and to view operation of the host system. Still more particularly, the present invention r lates t a 
diagnostic subsystem for a host sewer computer that monitors the host server for errors, and in response, takes ap- 
propriate action, including alarm generation, error diagnosis, and in some instances, corrective action to restrict or 
minimize the error condition to pemiit continued operation of the distributed computing system. 

10 Distributed computing systems (or computer networks) are generally well known. Such systems pennit communi- 

cations between applicatton programs toaded on numerous computer work stations in a network. There are numerous 
types of distributed computing systems commonly classified by the geographtoal extent of their communication capa- 
bilities Some of the terms used to classify the geographical breadth of distributed computing systems are lor example 
kxal area networks (LAN's), metropolitan area networks (MAN's) and wkJe area networks (WAN's). 

IB Computer network systems such as LAN's have become one of the most important devices for storing and sharing 

data in a business. Thus, computer networks are one of the most critical pieces of equipment in a business office. A 
failure in the computer network can cause business operations to grind to a halt. Computer networks typically compnse 
a plurality of personal computers and other data processing devices connected together for infomnation exchange. At 
the heart of the computer network is one or more file servers. In most computer networks, file servers are responsible 

20 for administrating and storing the documents generated by each of the personal computers (PC's) in the system. In 
additkm to managing the network, file severe also preferably Include the capability to monitor for faults in the computer 
network. If a fault or security breach is detected, the file server provides a warning of the fault and h certain instances 
may also piovrtde diagnostic operattons and may even implement corrective measures. 

Because of the fact that file senrers are responsible for storing much of the data generated in the computer network. 

2S the file sewer typically has mass storage capabilities. The mass storage typtoally is implemented In a disk subsystem 
or disk array To presen/e the integrity of data stored in the disk array, it is common to provide redundant ar^ « 
inexpensive disk (RAID) protectton. to prevent loss of data in the event of sewer or network system errot lypkally. 
SCSI hard disks are used to peimil multiple hard drives to be chamed toa single Intertace connector 

Network irterface contrdler (NIC) cards cbrnmonly are used to couple the senrertoone or more computer networks. 

so BoththeSCSI hard disk and the network Interface controller (NIC) connect to slots or connectore in the sewer system 
board A sewer system board includes connectore for receiving external cards, such as the NIC cards. Other bus 
xpan'slon stole may also be provWed on the sewer system board, such as EISA (Extended Industrial Standard Archi- 
tecture) and PCI (Peripheral Component Interconnect) expanston sk>ts for receiving peripheral devfces designed for 
those particular bus standards. _. . j» .j 

ss - Files or data, are maintained by a host processing system within the sewer. Sewers are designed to provide work 
statnns with fast access to files stored by the sewer. Accordingly, file sewers embody a host computer whfch responds 
to an operating system program (a popular operating system being, for example. WINDOWS NT®, or NETWARE®) 
t not only orchestrate the files but also to maintain file security, file backup, or other file management features. On 
important aspect whteh ftows from maintaining host f uncttons within a sewer is the capability to manage the host from 
40 a remote site, and to even pennit management of the host from sites remote from the network. Recently there has 
b en a steady Increase in the number of sewere that are used in businesses. Increasingly, sewers are placed at each 
k)cation of a business, rather than using a centralized main frame computer at a single kicatkxi. Typically, a company 
has an Individual or department responsible for administering all of the file sewere. In many if not most instances, the 
administrator or administratton department Is physically kxated at a single site, such as the companies' headquarters 
Thus each of the sewere must either be maintained and administrated remotely or else personnel must be transported 
t remote offnes to permit on-site management. 

An example of a common sewer host system Is illustrated In Figure 1 . As shown generally in Figure 1 . one or mor 
host processors are coupled to a system bus. The host memory couples to the system bus via a memoiy «>n*r<^' [ 
unit The system bus. in turn, also couples to a PCI bridge which interfaces the system bus to a PCI bus. Vbnous PCI 
peripherals may reside on the PCI bus. A PCI to EISA bus bridge typically couples the PCI bus to an EISA bus. Nferious 
EISA peripherals may couple to the EISA bus. in additton. and as shown in Figure 1. an APIC bus may couple directly 
between the host system processor and the PCI to EISA bus bridge. Other per^pherais may also couple to the syst m 
bus. Netvyorit interface cards (NICs). for connecting the sewer to the PC's, may couple to either the PCI bus or to the 

ss Many operating systems permit access to the host from a remote site, often call a -virtual temiinal." A virtual 

tenninal. whil not physically connected t the host, non theless allows remote control of c rtain op rations of the 
host Products such as Compaq Sewer Manag r® and Compaq Insight Manag r®,obtainabl from Compaq Computer 
Corp have attempted to address some of th Issu s involved in managing a network of distribut d s w re from a 
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single, remote site. These products permit an administrator t be notified of a remote sen/er failure, to reset the sender 
from.theremot sit . and to access c rtain information provided on the s n^ercons le. Compaq's Insight Manager<B> 
permits remote maintenance of th fil sen^ r as w II as local and remote notification of errors. In addition, Insight 
Manager® permits the file sen/er to be re-booted from a remote location or from any syst m on the n twork. Insight 
s ManagertS^als provid s control faciliti s including diagnostic capabilities to analyze the conditl nof th sender system 
configuration and to update system firmwar . Insight Manager® collects and monit rs sender data as w II as data from 
each client in the network and alk>ws the netw rk manag r to act on the data from a remote locatton or any work statkm 
on the network. In addition. Insight ManagettB) includes the capability to set user defined thresholds which permit the 
server to monitor system parameters and to alert the network manager when an error occurs. Notificatkm in the event 

^0 of an alert or a failure is delivered in many possible ways including on screen messages, a pager, e-mail, fax and SNMR 
it is certainty beneficial to allow remote control of certain sender functions, especially those needed to reset one 
r more servers within network of servers. Downtime caused by sender failure is probably the most costly time involved 
in running a distributed computer system. The causes of server failure, often termed sender host 'crash" are numerous. 
Any number of malfunctions or design flaws associated with the server hardware, server operating system or applbatton 

IS program running on a sender may account for a server crash. If a server crashes, then file access is often lost and 
business records are temporarily inaccessible until the cause of failure is fixed. 

A true benefit would result if an administrator located remote from the server can do more than be alerted to, and 
then reset, a failed senrer. In particular, it would be advantageous for the administrator to determine the cause of sender 
failure so that he/she can possibly prevent future failures before they occur. Prevention of failure Is as important, if not 

^ more important, than resetting a server that has crashed. 

The cause of a failure is generally displayed on the server console at the time in which the server crashes. Moreover; 
irregularities in the server host hardware or operating system software can be detected upon reset (or *boot*). Those 
irregularities can lead to future failure If not attended to by the administrator. Accordingly, it wouM be beneficial to gain 
access to what is displayed on the server host console not only during server reset (or failure) but also leading up t 
s rver reset/failure. lnformatk)n within the video screens (rhore partKularly the sequence of vkJeo screens) displayed 
on the sender console, whk:h occur duroig sen/er failure or reset would help remotely bcated administrators determin 
(and hopefully fbc) an existing sen/er failure or potential failure. 

The vkieo screens, resulting from a reset or failure of the server, comprise a sequence of vkleo screen changes 
displayed on the host server console by the operating system, system basic input output system ("BIOS"), server 

30 application program or other system software. In particular, capture of two screen change sequences are of particular 
interest to a sender administrator. In order to fbc an existing failure or a future failure, it would be beneficial that the 
administrator be given the sequence of screen changes prk>r to server failure as well as the sequence of screen changes 
f Itowing a reset. Examples of sender failure screens displayed on the sender console are Microsoft Corp., Windows 
NT® 'blue screen" and Novell Corp.. NETWARE® ABEND message which appear on the sender console when the 

55 respective operating system crashes. These screerrs provkle informatkm such as processor fault Indicia; system soft- 
ware routine addresses, and pertinent system memory contents. Upon reset of the server, the power on self test 
('POST*) code, associated with the aforementioned operating systems, typically perfonrns some system diagnostk: 
functbns and displays information regarding failures detected to the server console screen. It would be desirable t 
capture such sequences and replaying them at a remote management site. It wouki also be desirable to have a remote 

^ console system which would permit diagnostk: inquiries to be developed remotely whk:h then could be transmitted to 
the host server system for execution. Thus, for e^^mple, tests can be run to detect open and short circuits in the system, 
which are the leading causes of failures. The results of these tests would then be relayed at the remote management 
site. 

In additkMi to remotely monitoring the host server system, it would be advantageous to design a host sender which 
^ is capable of sury'iving the majority of system faults. In that vein, it would also be advantageous if failed components 
could be detected and isolated from further interactbn in system operation. It would further be advantageous if failed 
components in the system could be replaced dynamically with spare components without requiring a system shut down. 

In systems with multiple bus masters, it is common to a ward mastership of the bus based upon priority valu s 
assigned to each master. Thus, when two masters request bus mastership at the same time, the master with the higher 
priority value will be granted mastership. Awarding mastership based only upon the preordained priority value of the 
bus master may cause the system to operate inefficiently- In addition, awarding mastership in this fashk)n ignores the 
type of transaction being executed, and also ignores the relative importance of the targets. It would be advantageous 
if additbnal criteria were examined for awarding bus mastership other than simply the relative priority of the masters 
to enhance system periormance. 

The problems outlined above are in large part solved by a remote communicatk>n system of the present invention. 
Th r mote communication system includes a syst m management module (SMM) which can be included in the s rver 
system (such as, for xampi , by connection to an xpansk>n bus on the s n^er system board). Pref rably. th SMM 
is conn ct d to the PCI bus, although th SMM may be kx:ated elsewh r if desired. The SMM includes a system 
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management processor (SMP) and a system managem nt c ntral (SMC) control logic. The SMP and SMC connect 
to a system management (SM) local bus, which also connects to a VGA controller and a keyboard interfac controller. 
The inclusion of a VGA controller n the SM local bus permits an administrator or other user t remotely monitor video 
data reflecting th operation of the SMM. Similarly, the inclusion of the keyboard interface on the SM local bus permits 

s the system administrator r other user to directly access the components on th SMM (such as the SMP and SMC) 
by gen rating commands r motely for these components. The Implem ntatfon of the VGA controller and the keyboard 
interface as part of the SMM thus permits an administrator to gather information stored by the SMM and t command 
the SMM to perform certain operations. In additbn, the inclusbn of these components In the SMM permits a user to 
access the server in a virtual terminal mode from a remote location. 

10 The SMC of the present invention preferably includes a bus arbitration and monitor unit that preferably operates 

as the main arbitration unit for the PCI bus. Thus, all requests for mastership of the PCI bus pass through the SMM. 
The bus arbitration and monitor togic receives bus mastership requests from the various bus masters in the system 
and grants mastership based upon certain criteria. In additk)n, the arbitration and monitor logk: monitors the activities 
of the bus master to determine if the bus master responds adequately to mastership grants. In the absence of a proper 

IS response, the arbitration and monitor logic will again attempt to grant bus mastership to the designated bus master. If 
the master does not perform successfully within a certain number of retries, the bus arbitration and monitor togic isolates 
that master from the system by masking the request/grant line to the arbitratton unit and masking the intermpt lines to 
the system processor for that master. Thus, the defunct master cannot be granted nnastership of the bus, and cannot 
transmit interrupts to the host system. The master therefore is effectively removed from the system until the master is 

20 repaired, or until the system is reset. 

The SMG of the present invention also Includes an IEEE 1149.1 compatible bus controller. This controller is used 
to run Boundary Scan tests on the host system. The SMP controls the SMC bus controller 

The SMM of the present inventfon also includes the capability to reconfigure memory and inputAoutput space for 
dynamk^ally switching to spare components without shutting down the system. If a spare is provided for a system 

2S component, the SMM dynamk:ally reconfigures memory and I/O space in the event that the primary system component 
fails so that the spare assumes the address of the failed system component. In additkxi, the SMM functions to copy 
data from the failed component to the spare. During the conflguratfon and switching, the SMM locks the appropriat 
bus (such as the PCI bus) to prevent any traffic on the bus until the spare has been switched. 

The main arbitratton logic in the SMC preferably grants bus mastership based upcn the Master-Target combination. 

so Thus, certain cycles are given priority when awarding bus mastership. The arbitration and monitor togic keeps track 
of the identity of the current bus master. In additton, the arbitration and monitor logfc may be programmed to award 
priority based upon certain master-target combinations. Thus, the SMC artDitration unit receives bus mastership r - 
quests, and also monitors for the address of the intended target. Mastership grants are determined by both the source 
of the request and the intended target of the cycle. 

3S The SMM of the present Invention preferably includes system management remote (SMR) units whtoh function t 

monitor condittons in the host system. The SMR units couple to the SMC viia an IEEE 1149.1 standard test access 
bus, and one or more special serial interrupt busses. The SMC and SMR units transmit signals over a serial interrupt 
bus (either the Granite interrupt bus (GIBUS) and/or the inter-chip inten^upt bus (ICIB)) using time division multiplexing, 
in the event that the SMC cannot kJentify the source of an interrupt, the IEEE 1149.1 standard test access bus rnay 

40 be used to identify the source by communicating with the SMR's in the system. The SMC includes interrupt routing 
logic to mask certain interrupts in the event of a system failure. 

These and other advantages of the present Inventton will be apparent to one skilled in the art upon reading the 
foltowing detail description of the invention. 

A better understanding of the present invention can be obtained when the foltowing detailed descrlptton of the 

45 preferred embodiment is considered in conjunctton with the foltowing drawings, in whtoh: 

Figure 1 is a simplified btock diagram of a prior art host server system; 

Figure 2 is a btock diagram illustrating a host sender system constructed in accordance with the preferred embod- 
iment; 

so Figure 3 Is a block diagram deputing the configuration of the system management module (SMM) of Figure 2; 

Figure 4 is a block diagram depicting the configuration of the system management central (SMC) of Figure 3; 

Figure 5 is a functional block diagram showing the routing of interrupt signals in the host server system; 

Figure 6 is a flow chart illustrating the operatton of the SMC when dynamtoally switching to spare components in 

the event of a failed component; 
ss Figure 7 is a btock diagram showing in more detail the manner in which the SMM performs dynamic switching; 

Figure 8 is a flow chart depicting th operation of th arbitratton and monitor logic of Figure 4 for disabling faulty 

bus masters; 

Figure 9 is a flow chart illustrating the mast r-target based arbitratton p rform d by th arbitration and monitor 
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I gic f Figure 4; 

Figure 10 is a block diagram depicting the pref rred configuiati n of the system management remote (SMR) units 
f Figur 2. ' 

Figure 11 A depicts a prior art arbitration scheme using round robin logic; and 

Figure 11 B depicts an arbitration sch m implementing exemplary Master/Target priority logic. 

Referring now t Figure 2. a block diagram of an exemplary computer syst m 5 is shovwi. The computer system 
5 encompasses any system which stores and retrieves files, including a file senrer. an application sewer, or any other 
sender used in a distributed computing system. The computer system 5 preferably includes a host subsystem 25 which 
compnses a host central processing unit (CPU) 10 coupled to other components in the host subsystem 25 via a CPU 
local bus (or host bus) 35. The CPU 1 0 includes any data processing unit which implements a predetermined instruction 
set and has basic computing features such as executton units, controUliming units and registers. Exemplary CPU's 
include the Pentium® and Pentium Pro© processors manufactured by Intel Corp. As shown in Figure 2. multiple CPU's 
1 1 . 1 2, 1 3 may be provided to enhance the processing capabilities of the system 5. 

Various peripheral devices couple to the host CPU and the host bus by connection to one of a number of system 
penpheral busses. In accordance with one embodiment of the present inventton, the bus architecture of the computer 
system preferably includes a 32-bit PCI (peripheral component interconnect) bus 50 and 32-bit EISA (extended indus- 
trial standard architecture) bus 60. Alternatively, the bus 60 can comprise an ISA (industry standard architecture) bus. 
or any other bus architecture capable of interconnecting components in a computer system. In addition, while the biM 
60 IS shown coupled to the PCI bus. one skilled in the art virill recognize that other system configurations may be used 
to interface the various busses that are implemented in the system. 

The host subsystem 25 preferably couples to the PCI bus 50. The host processor subsystem 25 includes a proc- 
essor-PCI bridge 18 that connects between the PCI bus 50 and the host bus 35 to convert processor q^les to PCI 
cycles and vfce versa to facilitate data transfers between the host bus 35 and the PCI bus SO. In the preferred embod- 
iment, the processor-PCI brkige 18 also includes a memory controller for controlling operations to host memory 15 
The memory controller 1 8 orchestrates the transfer of data between the host bus 35 and host memory 1 5. The memory 
controller 18 responds to various addressing technkjues. and preferably is configured to support various storage cell 
architectures within system memoiy 15. including various dynamk: random access memory (DRAM) implementatnns 
and static random access memory (SRAM) conflgurattons. The processor-PCI bridge 18 is preferably implement d 
with an application specific integrated circuit (ASIC), but altemiatively can be implemented with the 82434LX PCI/ 
Cache/Memory Controller (PCMC) fiom Intel Corp®. Each of the CPU's in the host subsystem 25 preferably connect 
to an external cache 17. Control of the extemai caches 17 is provided by the processor-PCI bridge 18 or by togk: in 
the cache 17. 

Various peripheral devtees may connect to the PCI bus 50. as will be understood by one skilled in the art For 
xample, senrer systems preferably include a SCSI hard disk controller 45 connected to the PCI bus 50 to control 
operattons of various SCSI devices through a connector (not shown). The various devbes include haid disk drives and 
CD-ROM drives. Other expansion slots 47 may be provided for coupling other peripheral devwes to the PCI bus 50 

In the preferred embodiment, a system management module (SMM) 100 preferably also connects to the PCI bus 
50. The SMM 100 may be provided integrally with the computer system 5 on a system board, or may be provkJed as 
a plugin board for connecting to the PCI bus 50 in accordance with known techniques. The SMM 1 00 detects failures 
in the computer system 5 and preferably includes the capabUity to correct some of those failures. As to the failures 
that cannot be corrected, the SMM 100 preferably is capable of isolating the source of thefailure to enable the computer 
system to continue operatton. with limited capabiltties. In response to an error or failure, the SMM 100 preferably 
transmite an enor signal to a systems administrator to initiate error analysis followed in some instances by a syst m 
reset. In accordance with the prefen^ed embodiment, the error analysis and reset are perfonned remotely. 

The system management module 100 may monitor various electrical and mechank»l parameters in the system 
to detect system errors. Thus, for example, the SMM 100 monitors for noise in the power supply circuH, as well as for 
noise in other components in the computer system, detenmines if open and shorted connections exist in the computer 
system, checks for failure in the various memory devices, and detects failures in the CPU, bus bridges, hard disk 
controller, hard disk drives, and network interface cards (NIC's). The system management module 100 also monitors 
the environmental conditions of the computer system 5. such as temperature and fan operatton. 

Referring still to Figure 2. the SMM 100 preferably couples to one or more PCMCIA connectors 95. 96 to permit 
the SMM 100 to communicate to other components on the networi< (via a NIC card) or to remote kxattons (via a 
modem). The SMM 100 also preferably includes its own power source 97 in the event of a power failure to the system 
Thus, power source 97 preferably comprises a battery back-up, or other uninterruptible power supply. The SMM 100 
c uples to various Syst m Management R mote monitoring units (SMR's) 71, 72, 73 in th system pref rablyviathe 
IEEE 1149.1 standard test bus. and a bi-directional Granite int rrupt serial bus (Gl Bus) 70. In additton the SMM 100 
couples via a high speed bi-directk>nal s rial inter-chip int rrupt (ICIB) bus 65 to an EISA system management remote 
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(EISA SMR) unit 76. The SMM 100 also may connect to other buses in the syst m. including the APIC bus 80 and 
IEEE 1149.1 standard test bus 81. As one skilled in th art will appreciate, connections to some of these buses may 
be omitted. Conversely, in the event other buses are available in the system, the SMM 1 00 nnay connect to these buses 
and may transmit signals over these bus s to facilitate system diagnosis and error d tectlon. 

5 Preferably connected b twe n th PCI bus 60 and th EISA bus 60 is a PCI-EISA bus bridg 85. The PCI-EISA 

bus bridge 85 preferably comprises an Intel 82420/82430 PCI S t EISA Bridge that controls the transf r f data and 
address signals between the PCI bus 50 and the EISA bus 60. In accordance with nonmal convention, the c mputer 
system 5 includes slots 62 on the EISA bus 60 for receiving expansion boards. In one embodiment, a network Interface 
card (NIC) 61 connects to the EISA bus 60 for controlling communrcations with an external kDcal area network (not 

10 shown specifically). A data buffer 66 also connects to the data portk>n of the EISA bus 60 to provkle an additional data 
bus (the X bus 90) for varfous components. A system flash ROM 79 receives Its control and address signals from the 
EISA bus 60 and connects to the data portion of the X bus 90 for data transfers. Preferably, the system flash ROM 79 
contains the BIOS informatbn for the computer system 5. A floppy controller 86 also connects to the data portion of 
the X bus 90 for performing data transfers between a floppy disk drive (not shown) and other components in the 

16 computer system 5. In the preferred embodiment, a system management renrwte (SMR) unit 76 couples to the EISA 
bus 60 for monitoring conditions on the EISA bus 60 and transmitting signals to the SMM 100. As will bo described In 
more detail, the SMR unit 76 monitors the operation of the EISA bus 60 and transmits a signal to the SMM In the event 
of an error conditton on the EISA bus 60. SMR 76 also preferably relays interrupt signals appearing on the EISA bus 
60 to the SMM 1 00. SMR units 7 1 , 72 and 73 connect to various system togic and busses to provkSe monitoring functions 

20 and data relay to the SMC. Preferably, this logic includes the host-to-PCI bridge and the PCMo-EISA brkJge, and the 
host bus. Thus, as shown in Figure 2, SMR 71 couples to the host bus 35 to monitor operations on that bus. SMR 72 
connects electrically to the PCI-EISA bus brfclge 85 for monitoring operation of the bus bridge. Similariy, SMR 73 
connects to the host bridge 18 for monitoring the operation of the host bridge. In the event of an error on the bus or 
bridge being monitored, the SMR unit transmits a signal to the system managemisritmodule 100 via the bWireclional 

25 (QrorlCIB)bus. 

Referring now to Figure 3, the system management module 100 preferably includes a system management proc- 
essor (SMP) 1 50. whfch preferably is Implemented with the Intel 386EX processor. One skilled in the art will understand 
that other processors may also be used as the SMP 150. The SMP 150 preferably provkJes addilfonal intelligence to 
the host system 5 focused primarily upon error detection and system admlnlslratten to reduce the load on the host 

30 CPU 10. In addhton, according to the preferred embodiment, the SMP 150 also preferably functions as a "sun^ival" 
processor in the event the host GPU 10 "hangs" or is rendered inoperable by system error. The SMP 150 thus permits 
access to the host computer system 5 for purposes of accessing system components to determine the source of th 

rror. The SMP 1 50 also preferably detects other system errors and isolates the source of the errors to permit the host 
corhputer system 5 to continue operations. » ^ 

ss The SMP 1 50 preferably operates at a frequency of at least 25 MHz with a supply voltage of 5 volts. As one skill d 

in the art will understand, however, other ckx:k frequencies and power supply requirements may be used depending 
upon the processor selected as the SMR The SMP 150 preferably couples to a 16-blt )oca\ SM bus 75, whfch preferably 
is based primarily upon the protocol off the ISA (Industry Standard Architecture) bus. In accordance with the preferred 
mbodiment, bi-directional tri-state buffers (not specifically shown) are placed between the processor address and 

40 data sisals and the address and data signals SA and SD off the SM bus 75. These buffers also are placed on th 
IEEE 1149.1 bus and may also be placed on the IIC bus, or any other bus which connects togfc (including battery- 
backed togfc and other system logic). The tri-state buffers preferably are implemented with high speed bus CMOS 
switches such as the QST 3384 or 3584 from Quality Semfconductor Inc., although other high speed bus switches 
could be used. In the preferred embodiment, the tri-state buffers are embedded in the SMP 150 to reduce cost and 

45 board space requirements. In the embedded embodwnent. the tri-state buffers can either be a second stage of buffers 
or can comprise the existing tri-state buffers used during hold operations. The control signals df the SMP 150 connect 
directly to a system management central (SMC) ASIC 200. A serial port 1 33 and a parallel port 1 35 preferably conn ct 
directly to the SMP 150, whfch preferably includes integrated controllers for controlling communications with the senal 

arKi parallel ports. ^ w o^jo 

so The system management processor 150 preferably has the same privileges as the host CPU 10. Thus, the SMP 

160 can reboot the system 5 without accessing the host CPU 10. In the event that the SMP is removed from the PCI 
bus 50 or the PCI bus fails, the SMP 150 can still operate and communicate with the VGA controller 175. the keyboard 
controller 185, the SMM NIC 192 and the SMM modem 194. In the pretended embodiment, the system management 
module 100 preferably isolates and utilizes separate input, output and memory maps and interrupts. The SMC 200 
ss functions as a switch box for int rrupt routing and int rfacing the various busses. The SMC 200 routes keyboard and 
mous int rrupts individually to the system manag ment processor 150 and to the host CPU 10. 

Also connected t the SM bus 75 is a pair of PCMCIA slots 1 95, 1 96 for connection to PCMCIA cards, such as a 
network Interfac card (NIC) 192 or a modem 194. The Inclusion of the NIC 192 on the SM \oca\ bus 75 permits the 
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SMP. 150 to pert rm accesses to th distributed network system even if the comput rsyst m 5 becomes inoperable. 
Aaa result, if a failur occurs in the host CPU 10. the NIC controller 61 . PCI bus 50. and/or EISA bus 60. the SMM 
1Q0 still may perf rm access s to th n twork system through a SM NIC 192 connected to slot 196 on the SM kx:al 
bus 75. Similarly, th SMP 150 may send messages to a r mote locatk>n via the SM modem 194, which connects to 
s slot 1 95 on the SM local bus 75. Thus, the SMP 1 50 is capable f sending r mote signals in the event that the computer 
system 5 crashes. 

The SMM 1 00 also includes random access mem ly 1 1 0 preferably in the f mn of dynamic random access menrrary 
(DRAM). Other types of memory circuits also may be used such as static random access menrwry (SRAM) circuits, if ' 
desired. The local memory 110 preferably has a storage capacity of at least 2 Mbytes, with expansion capabilities to 

10 at least 4 Mbytes. The SM DRAM 110 is used to store data and code for execution by the SMP 150. 

A kx:al system management electrically erasable programmable read only memory (SM ROM) 160 preferably 
connects to the SMP 150 via control lines SC and connects to the SM kx^l bus 75 to transmit address (SA) and data 
(SD) signals when prompted by the SMP 150. The kx»t ROM 160 preferably stores power-on self-test (POST) code 
forthe system management module lOOas wellasfortherestofthecomputersystem.TheSMROM 160 also preferably 

15 stores test vectors for running an IEEE 1149.1 boundary scan test on the host system. The SMC contains an IEEE 
1 1 49.1 compatible bus controller for executing this test. 

v:. The SM ROM 160 preferably has a storage capacity of 4 Mbytes. The basic Input/output system (BIOS) for the 
SMM 100 also preferably is stored in the system management ROM 1 60. The SM BIOS assumes a 4 megabyte system 
management DRAM 110. During Initialization of the SMM 100, the SM ROM 160 writes certain patterns to the SM 

so DRAM 110, and performs reads and compares data to detenmine the actual size of the system management DRAM 
110. The memory nnapped devices on the system management module 110 include a vkdeo controller 175, system 
management DRAM 110, flash ROM 160, and PCMCIA devices 192. 194. VkJeo space size and addresses preferably 
are fixed, but system management DRAM 110, system management ROM 160, and PCMCIA sizes and addresses 
are programniable in SMC configuration registers. In additkxi, programmable page registers preferably are included 

2S In the SMC 200 to allow the SMP 150 to access the host memory 1 5 and the host I/O space. In an exemplary embod- 
iment^ each PCMCIA sk>t 1 95, 1 96 can be mapped into one of five memory address ranges above 64k and bek>w 16 
megabytes, starting at any 4k boundary. The SMC 200 preferably broadcasts SMP 1 50 memory cycles which have an 
< address outside of the programmable address regions to the PCI bus 50. The system management power on self test 
(POST) program enables the default broadcast feature after determining the SM DRAM 110 size by setting an appro- 

30 priate PCI definition register. Table I represents exemplary default addresses for various devices ki the system man- 
agement memory space: 



TABLE! 



Device 


Legend 


Size 


Address Space 


System Management Flash EEPROM 


SM^ROM 


8MB 


380_OOOOH - 3FF_FFFFh 


PCMCIA Socket A Mem Region 0 


MMR.AO 


Variable 


<+OFF_FFFFh 


PCMCIA Socket A Mem Region 1 


MMRLA1 


Variable 


<+OFF_FFFFh 


PCMCIA Socket A Mem Regton 2 


MMRJ\2 


Variable 


<+OFF„FFFFh 


PCMCIA Socket A Mem Region 3 


MMR_A3 


Variable 


<+OFF_FFFFh 


PCMCIA Socket A Mem Region 4 


MMR_A4 


Variable 


<+OFF_FFFFh 


PCMCIA Socket B Mem Region 0 


MMR^BO 


Variable 


<40FF_FFFFh 


PCMCIA Socket B Mem Region 1 


MMR^BI 


Variable 


<+OFF_FFFFh 


PCMCIA Socket B Mem Region 2 


MMR_B2 


Variable 


<40FF_FFFFh 


PCMCIA Socket B Mem Region 3 


MMR.B3 


Variable 


<+OFF_FFFFh 


PCMCIA Socket B Mem Regk)n 4 


MMR_B4 


Variable 


<+OFF_FFFFh 


System Management Memory 


SM.RAM 


1 MB 


010_000h-01F_FFFFh 


System Management ROM Shadow 


SM_ROM 


75 KB 


OOE^OOOOh- 00F.FFFFh 


(Can be PCMCIA Mem) 




75 KB 


OOC.OOOOh- OOD.FFFFh 


System Manag ment Vki 


SM^Vid o 


75 KB 


O0A_00O0h - OOB^FFFFh 


System Manageinent Base Memory 


SM.RAM 


640 KB 


000_0000h - 009_FFFFFh 
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Also coupled to th SM bus 75 is a video controller 1 75 and a k yboard and mouse interface controller 1 85. The 
video controller 175 and keyboard and mouse contr II r 1 85 preferably connect through bi-directional tri-state buffers 
173 to the SM bus 75. The address, data, and control portions that c nnect to the video controller 175 and k yboard 
and mouse controller 1 85 ar shown in Figure 3 as signal lines XSA, XSD and XSC: The SM local bus 75 connects to 

5 buffers 173 via address, data and c ntrol lines SA, SD and SC. Th nable Input of the tri-stat buffers 173 c nnects 
to the inverted state of a power good signal (PGOOD), which preferably may be ass rted by either the syst m power 
supply (not shown), or the batt . ry back-up 97 (Figur 2). The tri-state buff rs 1 73 preferably are provided betwe n the 
video controller 175 and the SM bus 75 to match the propagation delays associated with the address and data signals 
driven by the SMP 150 due to the tri-state buffers embedded in the SMP 150. 

to The keyboard/mouse Interface controller 1 85 preferably couples to the system management local bus 75 to provkie 

encoded input signals from the user to the various devices in the system management module 100. In similar fashion, 
the VGA controller 175 couples to the system management local bus 75 and to a display unit 170 where SMM and 
host system operations may be displayed to a local user. The keyboard controller permits the SMM 100 to simulate a 
keyboard, monitor accesses by the host, and take control of the keyboard from the host in appropriate circumstances. 

IS Simulatk>n of the keyboard Is performed by stuffing key scan codes into the keyboard controller 185 by the SMM 100. 
In the preferred embodiment, the keyboard controller 185 comprises an Intel part # 8042. During keyboard simulatnn. 
the host system preferably is locked out of accessing the keyboard controller 185 through the assertion of a ForceRetry 
signal to the SMC 200 by the SMP 150. In response, the SMC 200 denies other PCI masters access to the SM local 
bus 75. 

zo The inclusion of the keyboard controller as part of the SMM 100 permits the system management processor 150 

to view the input signals generated to the host system. To fully monitor keyboard operatk>ni the SMC 200 preferably 
receives interrupts from the keyboard, and monitors read and write cycles involving the kieyboard and mouse controller 
1 85. In the preferred embodiment, the system management processor (SMP) 150 receives the interrupt requests from 
the keyboard controller 185 for all operatkxis of the keyboard and mouse. In additkm, an interrupt is generated for the 

zs SMP 150 during writes to the keyboard controller 185. To allow the SMP 150 to view the data passing to the host 
system, the SMC 200 preferably maintains shadow registers of the keyboard controller 185. The status of the shadow 
r gisters is maintained during host operatkxi. Examples of the shadow registers are listed in Table II: 



TABLE II 



Addressibits 


Function 


Sub-functkxi 


xx60:7-0 


(ro) Shadow of 8042 60h when read by host 


Read clears the valid and over^run bits 


xx61:7-0 


(ro) Shadow of 8042 60h when written by host 


Read clears the valid and over-run bits 


xx62:7-0 


(ro) Shadow of 8042 64h when read by host 


Read dears the valid and over-run bits 


xx63:7-0 


(ro) Shadow ot 8042 64h when written by host 


Read clears the valid and over-run bits 


xx64:0{ro) 


(ro) Shadow status:8042 60h read vaiki 


1=register valid (accessed by host) 


xx64:1 


(ro) Shadow 8tatus:8042 60h write vaIki Read also 
clears SMP interrupt for write 


1 =:reglster valid (accessed by host) 


xx64:2 


(ro) Shadow status:8042 64h read valid 


1 ^register valid (accessed by host) 


xxb4:3 


(ro) Shadow 8tatu8:8042 64h write valkl Read also 
clears SMP interrupt for write 


1=register valid (accessed by host) 


xx64:4 


Shadow status:8042 60h read over-run 


1=rmore than one host access since last read 


xx64:5 


Shadow status:8042 60h write over-run 


1=more than one host access since last read 


xx64:6 


Shadow status:8042 64h read over*run 


1=more than one host access since last read 


XX64:7 


Shadow status:8042 64h write over-run 


l-nnore than one host access since last read 



The assertk)n of one of the interrupt lines by either the mouse or keyboard for more than 500 milliseconds without 
a status read of the 8042 controller by the host is interpreted by the SMC 200 as a host kx:k up requiring the SMP 1 50 
t take over control of the keyboard. The SMC 200 preferably includes a counter to measure the time between the 
assertbn of an inten-upt and a status read of the 8042 controller. When the SMP takes over operation of the keyboard, 
the SMC bkx:ks keyboard interrupts from being delivered to the host system. Once the keyboard tnt rrupts ar blocked 
to the host syst m, the system management proc ssor 150 can access the keyboard controll r 185 without host In- 
t rvention. 
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For th VGA remote console support, the SMP 1 50 pr ferably reads the state of the video controller 1 75 at regular 
inten^als f r the purpose of duplicating the vid o controller state. One problem which may occur is in reading th state 
of the VGA controller 1 75 when the host CPU 1 0 is making an acc ss to the video controller 1 75. Simultaneous accesses 
to the video contr Her 175 by the SMP 150 and the host CPU 10 presents a problem forth input/output registers used 

5 for remote switch cursor control and the like. To nsur safe access s t the VGA regist rs which form part of the 
vide controller 1 75, a lock signal is assert d by the SMP 150 to prevent the host from accessing the video controller 
175 during accesses by the SMP The signal is referr d to as the SMC.SLAVE^LOCK. The SMC 200 xecutes lock 
cycles on the PCI bus by running the PCI LOCK signal until the SMP 150 executes a no lock cycle. 

Referring still to Figure 3. the vkJeo controller 175 preferably is representative of commonly used video controllers, 

10 such as video graphics array (VGA) controllers. The vkJeo controller 175 has associated with it a vkJeo memory for 
storing pixel values to be displayed on display 170. The inclusion of the video controller 175 and keyboard and mouse 
controller 185 on the SM local bus 75 enables the SMM 100 to function in a remote terminal mode in which a remote 
user can access and/or control operation of the SMM 100. The remote user may reside on the network and connect 
to the SMM 100 through the SM NIC 192. or may couple to the SMM through the SM modem 194 from a location 

IS removed from the network. Thus, the remote user may generate, for example, keyboard and/or mouse signals that ar 
routed to the SM local bus 75 via SM NIC 192 or SM modem 194. The keyboard and mouse controller 185 receiv s 
the signals and encodes those signals for processing by the SMP 150. Similarly, video signals generated by the SMP 
(and if desired the host CPU 10) are provkJed to the VGA controller 175 via the SM local bus 75. The VGA controller 
175 processes the signals into a pixel format for displaying on a display unit, such as display 170. In addition, the pix I 

so signals may be transferred to a remote kjcatfon via the SM NIC 1 92 and/or SM modem 1 94 for viewing at a remot 
locatton using technques known In the art. In partfcular. the SMP 150 may cause pixel data to be written from vWeo 
memory to the remote tenminal via either the SM NIC 1 92 or the SM modem 1 94 while the host system is tocked from 
accessing the system management local bus 75. As a possible alternative, and assuming that the remote terminal may 
assert mastership of the SM kx:al bus 75, the remote tenminal may perform read cycles to the video controller 175 to 

2S read the pixel data stored In the video mennory. 

Thus, in accordance with the preferred embodiment, remote consolvig support can be performed through the use 
of the VGA controller 175 and keyboard controller 185, even in the event of a crash by the host CPU. a PCI bus hang, 
and/or a toss of power. The VGA controller 1 75 and keyboard controller 1 85 preferably provide remote console support 
f both the SMM 100 and the host system generally, based upon signals appearing on either the PCI or the syst m 

30 management local bus. 

The system managenrient local bus 75 preferably comprises an ISA style bus whfch is modified to permit the SMM 
100 to match VGA timing. Because the VGA controller 175 is located on the SM kx:al bus 75, the SMP 1 50 also can 
perform write cycles to the video controller 175 for screen save programs to cut down on PCI traffic and minimize use 
of the host CPU 10. thereby enhancing system performance. In the preferred embodiment, the SMM 100 has bus 
^ mastership capabilities on the PCI bus 50. thus permittnig the SMM 100 to run diagnostk: operatfons on the PCI bus 
50 and thus, to the rest of the system. In addition, because of the inclusbn of the video controller 150 and keyboard 
and mouse controller 185 as part of the SMM 100, commands for the SMM diagnostte operations may originate from 
a remote terminal. 

As will be apparent to one skilled in the art and as shown in Figure 3, a display unit 170 may be connected to the 

^ VGA controller 175 for locally viewing operatk>n of the SMM 100 and other operations of system 5. The display 170 
may include any console or monitor capable of receiving video data, such as a cathode ray tube (CRT), Ik^uki crystal 
display (LCD), thin film transistor (TFT), or any other suitable computer display Similarly, a keyboard 183 and mouse 
181 may couple to the keyboard and mouse controller 185 for provWing input signals from a k>cal administrator to the 
SMM 100, and to the computer system 5. generally Additk>nal input and output devices may also be coupled to an 

^ expansion bus (such as the EISA bus 60) if desired. 

In the preferred embodiment, and referring still to Figure 3, the SMM 1 00 or some other server system 5 component 
saves sequences of video screens changes and other system information just prior to server failure and just after sender 
reset. In the preferred embodiment, this Information is saved in memory resources provided in the SMM 100. such as 
SMC registers, the system management DRAM 110. and/or the System Management ROM 160. In the event of a 

so system failure, data stored in the system management memory devices may be written by the SMP 1 50 to the remote 
t mninal, or may be accessed by a k>cal administrator. Preferably, the remote terminal that couples to the SMM 100 
ither by nrKxJem 1 94 or NIC 1 92 includes a display apparatus and an input devfee. As will be understood by one skilled 
in the art, the remote terminal that performs the accesses to the SMM 100 may be k>aded with software for facilitating 
the remote access. Suitable software includes COMPAQ Insight Manager (CIM). 

ss Referring still to Figure 3. an expansion connector 1 62 nnay be provided on the SM te>cal bus 75 to permit additional 

p ripheraldevic stob connected to the SMM 100. The conn ctor 162 also may be us d to connect a diagnostic unit 
directly to the SM kx:al bus 75. 

Referring now to Figures 3 and 4, the system manag ment c ntral (SMC) 200 couples to the SM local bus 75 and 
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to th PCI bus 50. In addition, th SMC 200 also connects to various other busses that may be present In the c mputer 
system 5, including an IEEE 1149.1 standard test bus (JTAG) 81, a grant and Interrupt bus (Gl Bus) 70, an ICIB bus 
65, and an APIC bus 80. As shown In Figure 2, the SMC 200 couples via th se various buss st system management 
r mote(SMR)devlces71,72,73.76t monitor system op rations and to detect syst m faults. The SMC 200 preferably 

s is construct d as an application specific integrated circuit (ASIC). The SMC 200 provides a mechanism for converting 
SMP I/O to PCI configuration cycles. A s cond SMC may be provided on a secondary PCI bus to provide I/O APIC, 
PCI bus temnination, and PCI arbiter capabilities to support the PCI to PCI bridg and additional PCI masters. 

As best seen in Figure 4. the SMC 200 preferably includes a PCI interface 205, a PCI arbiter and logic monit r 
225, interrupt routing logic 235, address translation logic 240, system management registers 260, a local bus controller 

10 270, a system nnanagement processor controller 275, and SM memory controller 280. In addition to monitoring the 
status of the system management remote (SMR) devices, the SMC 200 operates as a bridge between the PCI bus 50 
and the system management local bus 75. The SMC (via the SM local bus controller 270) preferably tailors the cycle 
sp ed based upon the addressed peripheral on the system management bcal bus 75. 

Referring still to Figure 4, the PCI Interface 205 preferably includes PCI master logic 285 and PCI slave logic 290. 

IS Transaction cycles to the system management module 100 that are made oh the PCI bus 50 are processed through 
PCI slave logic 290. Transaction cycles originating from the system management model 100 that are intended for 
targets on the PCI bus 50 (including targets on the EISA bus) are made through the PCI master logic 285. PCI master 
285 and PCI slave 290 couple to the system management arbiter 210 to hitiate requests to obtain mastership of the 
system management local bus 75. Mastership grants are routed from the system management arbiter 210 to the PCI 

20 master 285 and PCI slave 290. The PCI interface 205 also preferably couples to address translation logic 240 and 
local bus path logic 265. 

The PCI slave logic 205 preferably has associated with it a force retry bit which operates under system management 
processor control. Assertbn of the force retry bit by the SMP 150 insures that the system management module 100 
will not be interrupted by PCI bus cycles during SMP accesses to local peripherals such as the VGA controller or the 

2S keyboard controller. The PCI interface 205 preferably has complete access to the entire host system when acting as 
a PCI master. In addition, the PCI interface controller 205 provides the system management processor 150 exclusiv 
access to any system peripheral within the host system. 

In the preferred embodiment, the PCI interface controller 205 preferably has the capability to run configuration 
cycles to configure devices on the PCI bus 50. As one skilled in the art will understand, devk:es on the PCI bus 50 are 

30 configured during configuratkvi cycles by assigning an address range and an interrupt to the device. In accordance 
with the preferred embodiment, the SMC 200 monitors activity oil the PCI bus 50 arKi the SMR 76 monitors activity n 
the EISA bus 60, and determines when a device on these busses fail. The present invention envisions the possibility 
of providing a back-up component on the PCI or EISA bus which is then used as a spare for the failed component. 
Thus, in the event that a system component fails, and a spare devbe is available in the system, the PCI interfac 

35 controller 205 may initiate a configuration cycle to dynamically substitute the failed component with the spare. The PCI 
interface controller 205 preferably Includes configurable memory mapping capabilities to enable PCI menrK>ry and I/O 
addresses to be modified. The system management registers 260 preferably are double mapped to PCI configuration 
space and to programmable EISA specific sk>ts for software access. In the event that a component fails on the PCI 
bus or in an EISA specific skvt, an alternative component that Is memory mapped may be dynamteally switched and 

40 activated by the isystem management central 200. 

One exemplary embodiment for implementing the dynamb switching of spare components Is shown In Figures 6 
and 7. As shown in Figure 7, the SMM 100 connects to the PCI bus 50 through address, data and control lines In 
accordance with normal convention. Three control signals have been shown wh»h are generated by the SMM 100 as 
part of the dynamk: switching capabilities. One skilled in the art will understand that these signals may be encoded 

4S using existing control lines (or combinatk>ns of lines), or may be implemented by adding additional control lines. Thus, 
as shown in Figure 7, the arbiter in the SMC does not give grant to any neater other than SMC to completely lock the 
PCI bus 50 during the reconfiguration cycle. While in the bus kx:k condition all PCI masters are disabled from accessing 
the PCI bus at>sent except for the SMC on behalf of the SMP. The SMM 100 also generates a reset (RESET [n:1]) 
signal that is directed to the failed component 330 via an SMR to disable that device from further operattons. The SMM 

so 100 also generates an initialize spare (INIT SPARE) signal that is directed to the spare component 340 on the bus to 
initialize operatkDn of the spare. As shown in the exemplary embodiment of Figure 7, the SMM 100 includes a memory 
map 360 of system resources. Although the memory map is shown residing in the SMM 100, one skilled In the art will 
understand that the memory map may be physically located at some other bcatlon In the system. The memory map 
360 preferably includes the current address range for each of the system resources, and also Includes an address for 

ss each of the spare components provided in the system, plus an indication of the devices in the system for which the 
spare may be substituted. Th m mory map preferably is configurable to permit address ranges to b changed as 
required during switching and oth r syst m operatk)ns. Thus, if a spare is to b substituted for another d vk: , the 
address range of the spare may be changed t what previously was th address range for th failed devic . The 
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address range for the failed device also is changed, or th failed device is completely disabled to prevent bus contention. 
As. an alternative, each f the system components may be notified of the new address of the spare component. If the 
system components support such capabilities. 

Referring now to Figures 4 and 6, an ex mplary routine for implementing dynamic switching Is sh wn. In step 301 
s theSMC200(pref rablythe PCI arbil randnwnit r logic 226) monitors the system f rfalled components. If ther are 
no component failures, the SMC continues n mnal monit ring and alarm operations In accordance with the principles 
of the present invention. If the SMC discovers that a device has failed In step 303, the SMP next checks (step 305) its 
configuration registers to determine if a spare device is present in the system which can assume the duties of the failed 
component. If no spare is available, the SMP isolates the failed device and cuts the request, grant and interrupt lines 
10 for the failed device (step 307). If the failed device is located on the EISA bus, the SMP initiates a slot specific reset 
in an attempt to reset the failed component. The reset is transmitted to the failed components via the SMC, through 
the SMR. If the device continues to fail and causes the EISA bus to fall, the SMP may disable the PCI-EISA bridge 85 
to prevent transactions between the PCI bus and the EISA bus. 

If a suitable spare component has been provided in the system, the SMR in step 309 goes into the bus lock state 
IS by setting the SMC.Bus.Lock bit in the SMC. In response, the arbltor refuses to grant mastership to other PCI masters 
other than the SMC. The SMP also reconfigures the mennory map, if necessary to associate the spare component with 
an address range. In step 311. the SMP copies the contents of the failed component to the spare device. After the 
copying is complete, the SMP then initializes the spare component, preferably by assigning an interrupt and an address 
range to the spare component. If the spare is given the address range of the failed component, the assignment of th 
20 address range may be delayed until after the failed component is disabled, in step 313 the failed device is disabled, 
and In step 315 the SMC.Bus.Lock bit is deasserted to permit the system to resume norrnal operatkxis. 

Alternatively, the PCI interface controller 205 may reconfigure the memory nr>ap in step 309 to disable the address 
range prevk>usly assigned to the failed component, and nnay then assign a new range to the spare component In th 
memory map. As one skilled in the art will understarKl, the I/O address ranges on the EISA bus are 8k>t specifte. Thus, 
2S to switch in a spare component on the EISA bus, the address range for the failed component must be switched to the 
new address range for subsequent transactions. 

Thus, in the event that a spare dev»e is provided in the computer system as a back-up to system component, th 
SMP via the SMC (and through the SMC) can disable the system component in the event of failure, and can dynamk:ally 
reconfigure the system to enable the spare to assume the functkDns of the disabled component. At the completkxi of 
30 the reconfiguration cycle, the bus lock conditfon is renrwved, and the computer system resumes nonmal operatk>n. 

Referring still to Figure 4. address translaton togic 240 functk)ns to translate address signals on the PCI bus 50 
and the system management kx:al bus 75 to correspond to appropriate bus protocols. The PCI bus 50 preferably is 
32:bit8 wkie and the SM k>cal bus 75 is either 16 bits or 8 bits wide. The address translator 240 takes SM addresses 
and converts these addresses to PCI addresses, and similarly, converts PCI addresses to appropriate SM address 
SS values. A kx:al bus data path module 265 contains all the togic that is used to capture data from the SM bus 75 and 
for multiplexing data from the PCI bus 50 to the SM bus 75. 

As shown in Figure 3, othercontrol signals between the SMP 150 and the SMC 200 preferably Include the READY 
signal and a next address signal NA driven by the SMC 200 to indicate when the SMP 150 can proceed with the next 
cycle; The signal READY is driven low by the processor controller 275 to indicate the end of the current transaction. 
40 In pipelined operattons, the signal NA is asserted tow to Indicate that the SMP 1 50 can provkie the address of the next 
cycle. The output control signals (shown generally in Figure 3 as signals SC) of the 386EX processor also include an 
address status (ADS) signal to indicate a valkJ bus cycle, a dataAcontrol (D/C) signal to indtoate if the current cycle Is 
a data cycle or a control cycle, a memory/IO (M/IO) signal to indicate if the current cycle is a memory or I/O cycle, a 
REFRESH signal to indicate a refresh cycle, a SMM management mode (SMIACT) signal to Indtoate that the processor 
^ is in system management mode, and the signal PWR to indtoate a processor write or read cycle. 

The address translation logic 240 is capable of redirecting system ROM so that the SMC can get the host's CPU 
to use the system management ROM 160 instead of the host system ROM 15. The address translation togto 240 
preferably provides support for memory mapped I/O from the PCI bus 50 to the system management tocal bus 75 and 
from the system management tocal bus to the PCI bus. Thus, the address translation logic 240 permits address and 
space translation of PCI memory cycles to system management module input/output and/or rhemory space. This per- 
mits access to I/O locations on the system management module 100 that are already occupied by other devices in the 
host system I/O space. Similarly, the address translatk>n logto 240 provkies address and space translatton of SMM 
memory cycles to PCI I/O and/or memory space. This permits access to I/O locattons on the host system tfiat are 
already occupied by other devices in the SMM I/O space. The address translation togfc 240 also directs SMM cycles 
^ t either the SM bus or to the PCI bus depending upon the address. 

Ref rring now to Figur s 3 and 4, the system managem nt arbit r 210 s rves as the main arbitration unit for the 
system managem nt local bus 75. Thus, all mast rship r qu sts for the SM local bus 75 ar receiv d and processed 
by the SM arbiter 210. In accordance with th pr ferr d embodiment, the sysl m managem nt tocal bus compris s a 



so 
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modified ISA bus typ bus system. The SM arbiter 210 preferably couples to local bus control 270, processor control 
275, memory control 280. local bus path logic 265. and PCI interface logic 205. The primary masters of the SM local 
bus 75 are the SMP 150 and the SMC 200 (specifically, the PCI slave logic 290) on behalf of another PCI (or EISA) 
master 

5 The SMP 1 50 is the default master on the SM bus 75 after system reset If the SMC 200 d sires to run a cycle on 

the SM bus 75, It asserts a bus h Id request signal HOLD to the SMP 150 to request contr I of the SM bus 75. The 
SMP 150 responds by asserting a signal HLDA to surrender control of the SM bus 75. If the signal HOLD is asserted, 
and the SMP 150 has a refresh cycle pending, the SMP 150 negates the -signal HLDA to request control of the SM 
bus 75. Upon sampling the signal HLDA, and after completion of the SMC cycle, the SMC 200 negates the signal 

10 HOLD to let SMP 1 50 run the refresh cycle. The SMP 1 50 perf omns the ari^itration between SMP and refresh accesses 
to the local bus 75. 

Referring still to Figures 3 and 4, a signal ADFLT for tri-stating the address and data outputs of the SMP 150 is 
provided by the SM arbiter 210 to the SMP 150. As mentioned above, there are two possible masters on the SM local 
bus 75. the SMP 1 50 and a PCI bus master A request from the SMP 150 is indicated by a processor request (PREQ) 

IS signal asserted by the processor controller 275 to the SM arbiter 210. A request from a PCI bus master is indicated 
by a local bus request (ISLREO) signal generated by PCI slave logic 290 in the PCI Interface controller 205 to SM 
arbiter 210. The various cycles executable by the modules in the SMC 200 preferably include a refresh cycle to SM 
DRAM 110, a PCI cycle to the video controller 175. a SMP cycle to SMC registers 260 (which include various menrwry 
and IAD map registers and configuration registers), a PCI cycle to the SMC registers 260, an SMP cycle or a PCI mast r 

20 cycle to the SM memory 110; an SMP cycle or a PCI master cycle to the SM ROM 160, a SMP cycle or a PCI master 
cycle to the SM local bus 75, a SMP cycle or a PCI master cycle to the IEEE 11 49.1 standard test bus controller (JTAG) 
logic 220. and a SMP cycle to the video controller 175. The SMP 150 also provides a processor readAwrite (PWR) 
signal to the SM arbiter 210 Indicating processor read or write cycles. When a PCI master is requesting a cycle to a 
target In the SMM 1 00, the SM arbiter 21 0 asserts a hold (HOLD) signal to the SMP 1 50. which the SMP 1 50 acknowl- 

2S dges by asserting the hold acknowledge (HLDA) signal. Cycles on the SM local bus 75, other than to the SM mem ry 
110, requested by the SMP 150 or a PCI bus master are generated by the local bus controller 270. The local bus 
controller 270 controls accesses to the video controller 1 75 and the SM ROM 1 60. 

If the SM arbiter 210 detects a request for the SM local bus 75. the video controller 175. or the ROM 160, from 
either the SMP 150 or the PCI slave logic 290, the SM arbiter asserts a run local bus cycle (RUNLBCYCLE) signal to 

so the local bus controller 270; In response to assertion of the RUNLBCYCLE signal, the local bus controller 270 generat s 
the appropriate cycles on the SM local bus 75. Once the requested local bus cycle has completed, the local bus 
controller 270 asserts a local bus cycle done (LBCYCLEDONE) signal back to the SM arbiter 21 0 to indteate completion 
of the local bus cycle. When the SM arbiter 210 detects the LBCYCLEDONE signal asserted, it negates the RUNLB- 
CYCLE signal. 

ss If the SM arbiter 210 detects a request for the SM memory 110 from the SMP 150 or from a PCI bus master, it 

asserts a RUNMEMCYCLE signal to the memory controller 280. Memory requests from the SMP 150 include read and 
write cycles as well as memory refresh cycles. As noted earlier, the SMP 150 preferably includes logic to arbitrate 
b tween refresh and other memory cycles. In response to assertion of the signal RUNMEMCYCLE, the memory con- 
troller 280 provkies the necessary control signals to the SM menrKjry 110 to perfomn the requested operatbn. When a 

40 memory operation is completed, the memory controller 280 asserts a memory cycle done (MEMCYCLEDONE) signal 
back to the SM arbiter 210. In response, the SM art^iter 210 negates the signal RUNMEMCYCLE. If the SM arbiter 
210 detects a SMP-to-PCI cycle, it asserts a signal RUNIMSTCYCLE to PCI master togic 285 In the PCI Interface 
controller 205. In response to the assertion of the signal RUNIMSTCYCLE. the PCI bus master 285 obtains control of 
the PCI bus 50 to run the desired cycle. Once the PCI cycle completes, the PCI master 2S5 responds by asserting the 

45 signal IMSTCYCLEDONE. A few ckjck perkxis later, the SM arbiter 210 deasserts the signal RUNIMSTCYCLE. Wh n 
the SM arbiter 210 negates the RUNIMSTCYCLE signal, a signal DJMSTCYCLEDONE in the PCI master logfc 285 
is negated one cksck (PCICLK) perkxl later, which causes the signal IMSTCYCLEDONE to be negated. 

When the PCI or SM local bus cycles have completed, the SM arbiter 21 0 drives a signal PDONE to the processor 
controller 275 to indk:ate the completion of a SMP requested cycle, or asserts a signal ISLDONE to the PCI slave logic 

so 290 to indicate the completion of a PCI bus master requested cycle. 

Local bus control 270 couples to the system management local bus 75 and f unctkxis to control the operatk>n on 
the local bus. In addition, local bus 270 couples to the SM arbiter 210. Processor controller 275 serves as an interfac 
to the system management processor (SMP) 1 50 (Figure 3). Processor controller 275 couples to SM artjiter 21 0, and 
system management local bus. Memory controller 280 controls transactions to the system management DRAM (SM 

ss DRAM) 1 1 0 (Figure 3). The memory controller 280 in accordance with known techniques, provides row address strob 
(RAS), column address strobe (CAS), and addr ss and control signals to the system management DRAM 110 to p r- 
form memory r quests. Memory controller 280 coupl s to command logk: 250 and SM arbit r 210. 

Referring now to Figure 4, th PCI arit>it r and monitor logte 225 couples to th PCI bus 50 and functions as the 
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main PCI arbitration unit f r the PCI bus. Thus, to the extent that other PCI arbiters are present In the computer system, 
those arbiters pr f rably are disabled m favor of th arbit r and monitor logic 225 In the SMC 200. PCI arbiter and 
monitor logic 225 pr ferabty couples t the syst m management registers 260. and includes a cl mency counter 227 
and an v nt counter 229, Counters 227 and 229 may be integrated as part of PCI arbit r and monitor logic 225 as 
will be understood by one skill d in the art. 

The PCI arbiter and monitor logic 225 monitors the PCI bus 50 to detemr>ine if signals appearing on the PCI bus 
ar consistent with PCI protocol. If signals appear out of sequ nee. the PCI arbit r and monitor 225 generates an error 
signal which preferably is transmitted to the system management processor 150 and host processor lb. Preferably, 
the PCI arbiter and monitor 225 also monitors the length of time of certain transactions, including (a) the time before 
FRAME # is asserted after REQ # and GRANT # are asserted on the PCI bus; (b) the time before data is returned or 
provided after the assertion of the FRAME # signal on the PCI bus; and (c) the length of time that the PCI bus is held 
after the REQ # signal is negated. Measurement of the response time preferably is performed by counter 229. Counter 
229 may comprise a plurality of counters for each of the different time out values or a single counter may be used with 
additional processing performed by the PCI arbiter and monitor logic 225. Alternatively, the PCI arbiter and monitor 
225 may include one or more internal counters for performing the time-out functions. 

The PCI arbiter and monitor 225 preferably captures PCI bus master lock commands and addresses upon rror 
occurrence on the PCI bus. In the event of a failure on the PCI bus which causes the PCI bus to 'hang." the PCI arbiter 
and monitor 225 detenmines how far the current cycle has progressed. Information regarding the current bus cycle 
may be stored either in the PCI arbiter and monitor logic 225 or in the system management registers 260. In the event 
of a failure, appropriate registers bits are read back indwating the signals that had been received during the current 
bus transaction. Thus, for example, the PCI arbiter and monitor 225 may monitor the PCI bus for the presence of the 
REQ #. GRANT #, FRAME #, DEVSEL #, IRDY #, TRDY #. and vartous other PCI bus signals generated during th 
current cycle, and In response to detectfon of the PCI signals, may provide an appropriate signal to the system man- 
agement registers 260 to load appropriate register bits. In the event of a failure, the processor controller 277 access s 
the system management registers 260 to read back the appropriate register bits for the current cyde. In addrtfon. th 
PCI arbiter and monitor k^gic 225 also preferably stores a value indicating the component that has mastership of the 
PCI bus, the current address and the cycle type when the PCI bus fails. For parity failures detected by the PCI arbit r 
and monitor 225 or other SMC components, the address or data causing the failure preferably Is latched and stored 
V^^iDJ^® system management registers 260. 

The PCI arbiter and nrwnitor 225 also monitors the operation of components residing on the PCI bus as the main 
PCI arbitratbn unit. In the preferred embodiment, the PCI arbiter and monitor logic 225 includes the capability of gen- 
erating an error signal if several accesses are made to a PCI component without a response. In this vein, and referring 
now to Figure 8. the PCI arbiter and monitor togic 225 tracks the activities of the varfous PCI bus masters. In step 282, 
the PCI arbiter logk: 225 detennines if PCI masters have requested ownership of the PCI bus. If more than one request 
IS received, the artjitration logk: 225 checks the prtorities of the requesting masters (step 284), and in step 286. Issues 
a grant to the master with the highest prkjrity. The PCI arbiter and monitor togic 225 then monitors the PCI bus (step 
288) to determine if the master granted ownership of the bus has started a cycle. If the master does not initiate a cycl 
within a specified time period (as detemfiined by event counter 229, for example), the arbiter logk: 225 checks the count 
in the clemency counter 227 for that master. If the count in the clemency counter exceeds a predetenntned threshoW 
value X (step 292). the arbiter logk: 225 disables the master in step 294 by masking the grant and interrupt lines for 
that master. If the count in the clemency counter 227 is less than x. the clemency counter is incremented for that mast r 
(step 296). After other rraster requests are sewlced in step 298, the faulty master is retried in step 299. 

Thus, if a PCI bus master fails to respond to a mastership grant by the PCI arbiter and monitor 225. its grant is 
removed if there is no other request pending as part of a normal rearbitration. On the first grant removal, a count r 
(such as event counter 229) is activated and the current master number is recorded in an appropriate register in syst m 
management registers 260 or in a latch in the PCI arbiter and monitor logic 225. Subsequent retries to the same master 
cause a counter to be incremented. If the master responds, the clemency counter 227 Is reset If the count fri the 
counter becomes higher than a pre-established maximum value x an error signal is asserted to the system management 
processor 150. In addltfon. the PCI artjiter and monitor togIc 225 disables the ability of that bus master to obtain mas- 
tership of the PCI bus. In the preferred embodiment, a register is provided with bits dedicated to the availability of 
various bus masters on the PCI bus. If a failure is detected for a particular bus master, a bit is set in a dedicated bit 
associated with that bus master device. When a mastership request is received by the PCI arbiter and monitor togic 
225, the error register is examined before mastership is granted by the PCI artjiter and monitor logk; 225. While on 
clemency counter is shown, one skilled In the art will understand that a plurality off clemency counters may be imple- 
^ mented if desired. 

The PCI artjiter and monitor logic 225 also preferably is capable of selecting and storing PCI bus cycles for sub- 
sequent examinatton. The PCI art>iter and monitor logic 225 includes a circular buffer, preferably constructed as a FIFO 
(first in, first out) register. In additton, th PCI arbiter and monitor logk: 225 operates in association with registers in th 
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SMC registers 260, including a bus capture regist r, a holding register and a cycle prbgr iss register. The PCI arbiter 
and monitor logic 225 causes cycles t be stor d in th r gisters including input/output cycl s, memory cycles, con- 
figuration cycles and special cycles. \^lu s in the registers can be read by the system management processor in the 
case of a bus fault on the PCI bus. In addition, th bus capture register and the circular buffer preferably are not clear d 
during SMC r s ts. The bus holding register preferably comprises an Intermediate register located-betwe n th^ bus 
capture regist r and the circular buff r. 

The bus capture register and FIFO status registers are illustrated in Table III: 



TABLE III 



Address: bits 


hunciion | Sub-iuncuon 


xx40:iD-U 


^ro) bus capture address 15-U 


low pan o2 address 




(ro > bus capture aadress J 1 - 1 6 


high pan ol address 


XX44:lD-0 


(ro) bus capture data 


low part o! data 



Address:bus 


hunction 


Sub-runction 


xx46:15-U 


(roj bus capture data i l-l 6 


high pan or data 


xx455:i-0 
xx4S:7-4 

xx48:ll-S 
xx48:15.l2 


(ro) bus capture address C/bbirj:!) 


command 


(rol bus capture Master granted the bus 


UrHost bndfic 

l:EISABridcc 

2:SMC 

3:DEC Bridge 

4:slotO " 

5:slotl 

6:sloi2 

7:sloi3 

8:slot4 

9:slot5 

A-F: rcser\'ed 


(ro) bus capture data CJ/bbfr3:U 


byte enaoies 


(ro| Reserved 




xx4A:U : 

xx4A:l 
xx4A:15-2 


Cr)iraclcer hll-U Status (Not empty) 

(w) Tracker reset (a 'wmt to this register of any 
value results in resening the Tracker FIFO and the 
capture register logic to allow operation to begin) 


1 :Mh U contains data 
OiFIFOempty 


(r) Irackcr MhU iitaius (Overrun) 
read clears FIFO over-run flag, if set 


l:i-ll-U overrun 
0:OK 


trj reserved 




xx4C;:U 
xx4C:l 

xx4C:15-2 


(rp) iracKer ritu LJcnirol/btatus (Not empty) 


1 ititu contains data 
0:FIFO empty 


(roj iracker l-ll-U fJonirol/Status (Uycrrun) it set, 
read clears FIFO over-run flag, else if Not mpty, 
read increments FIFO read pointer 


l:hlhU ovcmin 
0:OK 


(ro) reserved 





The cycle progress registers are used to indicate the progress of the current transaction on the PCI bus. An example 
of the tracker registers is shown in Table IVt 
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TABLE IV 



AddressiDits | h unction 


bub-nincuon 


xx4b:U 
xx4E:l 
xx4E:2 
xx4E:3 
xx4E:4 
xx4E:5 
xx4E:6 
xx4E:7 

xx4E:8 
xx4E:9 
xx4E:10 
xx4E:15-10 


(roj Cycle progress: hKAMh 




iroj Cycle progress: UUAL AUUKtbb 




(ro) Cycle progress: UlLV2>bL 




CroJ Cycle progress: IKUY 




(ro) Cycle progress: IKDY 




{JO) Cycle progress: BUKbl (muitiple data s) 




(ro) Cycle progress: LOCKiii-RAMh 


access to locked aeem 


Iro) Cycle progress: JLUCIUiU 


ILOCktl-KAMk 
followed by LOCK 


(ro ) Cycle progress: Kt I K v 




{JO) Cvcle progress: JL>lbCU:NNhCl 




(ro) Cycle progress: lARGLi ABURi 




(ro) reserved 




xxDU:lD-U 


(ro) IrackerhlhU address 


low pan ot aodress 


xxD2:i>-u 


(ro) Tracker MhU address 31-16 


high part ot address 



20 



xxo4:l^-U 


(ro) tracker MhO data lo-U 


low pan ot r.adress 


xx55:li-U 


(ro) tracker Mi-U data i 1-16 


high part ot address 


xx:>ly:i-0 
xx58:7^ 

xx58:n-8 
xx58:12 

xx58:13 
xx58:14 

xx58:15 


(ro) 1 racker HhU address Ubhs j.U 


command 


(ro) tracker hlhOMaster granied the bus 


U:Hosi brid&e 

1:£ISA Bridge 

2:SMC 

3 :DEC Bridge 

4:sloi0 

5:slotl 

6:slot2 

7:sloi3 

8:sloi4 

9:sloi5 

A-F: reserved 


(ro) I racker M^U data C/bbf? i:0 


Byte enables 


(ro) tracker titu progress: UUAL AUDRLSS 


Indicates that a 64-bit 
address was anempted 


^ro) tracker l-ltO progress: LOCKJbU 


•LUCKifchKAMb 
followed by LOCK 


(ro) tracker 1-tl-U progress: bUlCbt (mtiluple 
daia^s) 


tndicates that there y^-as 
more than one data 
cycle 


(ro) 1 racker h it- O progress: Uli^CUNN tC t 


tndicates ttiat target 
disconnected 



The system management registers 260 preferably fficlude nine address range register sets which can be pro- 
grammed with the address ranges of PCI access cycles. The range registers also are preferably used to specify the 
addresses of targets to monitor by the PCI arbiter and monitor logic 225. In the preferred embodiment, four range 
so registers are provided from memory cycles, four range registers are provided for inputAoutput cycles and one range 
register is provided for configuration cycles. Also, preferably a special enable bit is provided for special cycles. 

In the preferred emt>odiment. the PCI arbiter and monitor logic 225 is capable of determining bus master usage, 
total bus master access latency, total target latency, bus master access latency, the number of retries to a particular 
PCI device, the number of PCI bytes transferred in a particular cycle, the amount of time that the PCI master holds th 
S5 bus after the negation of the REQ signal, and various other perfonmance criteria. 

The system managem nt r gist rs 260 pref rably are acc ssible Ither by I/O r PCI configurati n cycles from 
the PCI bus 50 r by I/O cycles from the system management local bus 75. Th SMC 200 pr ferably is capabt of 
individually disabling any input/output or m mory region from either the PCI sid orth system manag ment side, and 
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can also disable the sourc of any det cted rror. To expand th system management processor addr ss base from 
64 megabytes to four gigabyt s, the SMC 200 translates SMP addresses using s gment regist rs. The SMC 200 is 
Initializ d once th power source (either system power or battery backup 97) is confirmed as acceptabi (batt ry k 
Is asserted). After power up, the SMC 200 determines clock phases and asserts a synchronous reset to the SMP 150 

5 and t the SMR's in the system. 

The PCI arbiter and m nitor logic 225 preferably receives bus mastership requ sts and assigns or grants master- 
ship fthePCIbus50. If a bus rnaster falls, the arbitratbn and n(K>nitor logic 225 deactivates the failed devl^^^ th r 
a master has failed can be determined using various methods. One method whteh can be used to determine whether 
a master has failed Is through software; Thus, a software diagnostic utility can be run periodically on the system to 

10 monitor the response of each device on the bus. If the arbiter and monitor bgic 225 determines that a device has failed, 
it can Ignore all mastership requests from that device, and refuse to grant mastership to the failed device. In additbn, 
th SMC 200 can mask the interrupt requests from the failed master through the Interrupt routing Implementation shown 
in more detail In Figure 5. The SMC 200 receives all of the Interrupts in the system through vark>u8 busses and passes 
th interrupts to the SMP 1 50. 

IS A number of SMR units are dispersed through the system to provide additional monitoring capabilities for the SMM 

100. The SMR's communicate with the SMC 200 via high speed bi-directional serial busses (either the Gl bus or the 
ICIB bus) and the IEEE 1149.1 standard test bus. Time division multiplexing Is used on the Gl bus or the ICIB bus, 
with the SMC driving high, and the SMR's driving low. The devices on the serial bus are synchronized to a particular 
counter. 

20 In the preferred embodiment, each of the SMR devices share a common architecture. Thus, in Figure 10 an ex- 

mplary SMR unit 700 is shown with the understanding that the configuration of. the SMR devices (for example, SMR 
units 71, 72, 73, 76) preferably implements the same configuration as SMR 700. The SMR 700 preferably includes a 
JTAG slave logic 705, an IIC logic 71 0, a general purpose input/output logic 715, interrupt bus logk: 720 and bus monitor 
logic 725. The SMR 700 preferably communicates with the SMC 200 (Figure 3) through the JTAG bgic 705 and IIC 

2S logic 710. The SMR 700 communicates with other peripheral devices through the IIC logic 710. 

The lie logic 710 preferably comprises a two-pin interface that connects to various slave peripherals, including 
non-volatile RAM's, temperature sensors, and analog-to-dlgital converters. The two pins controlled by the IIC logic 710 
are a serial clock interface (SCL) and a serial data interface (SDA). These two pins make up the IIC bus, whk^h imple- 
ments a protocol developed by Philips Sembonductor, Inc. The SCL and SDA pins are bi-d|rectk>nal, and the SMR 

^ both drives and samples these pins at the same time. The SMR 700 can operate as either a master or slave device 
on the IIC bus. On the IIC bus, the master is the devk;e that initiates a start conditk>n by changing the state of the SDA 
from high to low while SCL is high. The master maintains responsibility, for controlling the SCL line throughout the 
transaction. The slave either sends data to or receives data from the master. After a cycle is initiated, the master 
transmits a 7 bit address and a write/read bit. The receiver then transmits an acknowledge pulse. Ail subsequ nt 

55 transmissk>ns are 8 bit data pulses followed by an acknowledge pulse. The transaction is concluded by generatkxi of 
a low to high transitk>n on SDA while SCL is high by the master. 

After each byte is transferred across the IIC bus, the IIC logk: 710 issues an interrupt to the SMC 200 through the 
I BUS k>gic 720. In accordance with the preferred embodiment of Figures 2 and 3, ttie |BUS comprises either the Gl 
bus 75 or ICIB bus 60. The Interrupt on the Gl bus 75 or ICIB bus 60 is ia request for more data If the SMR is transmitting 

40 or a request to clear a data buffer if the SMR is receiving. The I IC logic 71 0 also issues an Interrupt when it is addressed 
by another master starting a transactk>n. In the preferred embodiment, the IIC logk: 710 includes four registers, an 
address register 712, an IIC clock register 71 4, a data shift register 716, and a control/status register 71 8. The address 
register 712 contains a 7 bit address whk:h the SMR transmits when addressed as a slave. The ck)ck register 714 
stores the ckx:k frequency implemented on the IIC biis. This frequency is programmed into the ckx:k register during 

4S initialization. The data shift register 716 performs all seriaMo-parallel interfacing with the IIC bus. The controUstatus 
r gister 718 contains the IIC bus status information required for bus access and monitoring. 

Referring now to Figures 2, 3 and 10, the bus monitor logic 725 preferably monitors a peripheral or system bus 
for usage and errors. In the preferred embodiment, thirty-one input/output pins are used to monitor the peripheral bus. 
The pins are configurable to permit the bus monitor logic 725 to monitor varbus busses, such as the host bus 35 r 

so EISA bus 60 (Figure 2). Thus, the bus monitor logic 725 is configured into a particular mode of operation during ini- 
tialization. In the preferred embodiment, bits 4 and 5 of the SMR configuration register 727 (address 1 2h) are dedicated 
t defining the mode of operation for the bus monitor logk: 725. Available nriodes include a first and second host mode 
(h St mode 1 and host mode 2) and an EISA mode. Thus, if configured for the EISA mode, the bus monitor logic 725 
checks the EISA bus for vark>us types of errors, and also collects statistics showing EISA bus usage. The bus monitor 

55 logic 725 preferably contains a RESDRV input pin that connects to the RESDRV of the EISA controller. The SMR bus 
monitor togic 725 preferably includes several output pins that are used to assert RESDRV t each slot of th EISA 
bus. The bus monitor logic 725 also preferably rec ives the EISA MAK # signals. 

As will be understood by one skilled in the art, the EISA controller asserts RESDRV as a response to an EISA bus 
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error. The bus m nrt r logic 725 intercepts this signal and generates an interrupt to the SMP 150 (Figur 3) via the 
IBUS (which f r th EISA SMR 76 is the ICIB bus). The SMR signals this Int rrupt to th SMC 200 (Figure 3) by 
asserting an interrupt on slot 2 of the ICIB bus. The SMC 200 then pass s this interrupt to the SMR Aft r receiving 
the RESDRV int rrupt. the SMP 150 determines the MAK # signals on the EISA bus by reading th status of a MAK 

s # latch regist r in bus monit r logic 725. Aft r determining which mast r had control of the bus wh n RESDRV was 
asserted, the SMP 150 causes a slot-specific RESDRV to be sent to the offending master. 

R f erring again t Figure 10, the SMR bus monit r logic 725 preferably includes counters (not shown specifically) 
to track the EISA bus total bus utilization and also the bus utilization due to EISA bus master activity for each stot. In 
th preferred embodiment, the bus monitor togic 725 includes inputs lines for coupling to the following EISA signal 

10 lines: CMD#, REFRESH. NMFLUSH. EISAHLDA. START#. EMAK1, EMAK2, EMAK3, MRQ#[0:51 and BCLK. Total 
bus utilization is developed by summing the START#, CMD#. and EISAHLDA signals, after multiplying each of the time 
that these signals are asserted by BCLK to make transparent the effect of stretching BCLK. Thus, total bus utilization 
= BCLK(START# asserted) + BCLK(CMD# asserted) + BCLK(EISAHLDA asserted). Bus utilizatk)n due to EISA bus 
master activity is based upon the MRQ# and MAK# inputs as foltows: 

IS 

Busnnaster util = IREFRESH & [BCLK(MAK# k>w) + BCLK(MRQ# low after MAK# high)] 

Each bus master slot has a separate counter. The utilization for each slot is found by dividing the count for that stot by 
20 the count of the free counter. 

The bus monitor logk; 725 preferably determines two types of EISA bus violattons. CMD* time-out and Refresh 
hold-off. A time-out violation occurs if the bus monitor logic 725 determines that any devtoe on the bus has asserted 
CMD# for more than 256 BCLK's. If a time-out occurs, the SMR signals the SMC by asserting an interrupt on the ICIB 
bus. Similarly, the SMR generates a refresh hold-off vtolatton if the REFRESH signal is not asserted for 256 BCLK's. 
2S In somewhat similar fashkxi the bus monitor logic 725 also is capable of monitoring the host bus to detect bus utilization. 
When In the EISA mode, the SMR 700 asserts interrupt requests on the EISA bus as requested by the SMC. Th 
SMC identifies the interrupt requests to assert and transmits those interrupt requests to the SMR across the IBUS. 
The SMR asserts the interrupt request until the SMC transmits a stop signal to the SMR. Thus, whatever the SMC 
drives to the SMR on the IBUS at a particular slot corresponds to what the SMR will drive on the EISA bus for that 
^ interrupt request level. 

Referring again to Figure 10, the general purpose inputAoutput logk: 715 is used to assert, de-assert, tri-state. or 
ch ck the value of a bit. The general purpose I/O logk: 715 includes registers (not shown specifically) that control or 
check the level of vartous output pins. When the EISA mode is selected in the SMR configuratton register 727. the 
general purpose I/O logic 71 5 changes its functionality, and all inputs preferably become inputs for the interrupt requests 
35 on the EISA bus. These inputs then are used to pass interrupt requests to the SMC via the IBUS. Stots are reserv d 
In the IBUS for each input. When In the EISA mode, the inputs are routed through a series of latches which filter glitch s 
on the EISA interrupt lines. The inputs are sampled on the rising edge of BCLK and latched. An interrupt which is 
asserted for greater than the period of the glitch filter but then goes away is latched inside the SMR and transmitt d 
to the SMC. An interrupt whrch is less than the period of the glitch filter is ignored. An interrupt whk:h is asserted for a 
^ period longer than the cycle time for the slots on the IBUS is not transmitted to the SMC again. 

The interrupt bus interface togic 720 connects the SMR 700 to a one pin specialized bus. which preferably Is either 
the GI bus or the ICIB bus (referred to generically in Figure 10 as the IBUS). This bus is used by the SMR 700 to send 
an asynchronous signal to the SMC 200 (Figure 3). Upon receipt of the signal, the SMC 200 sen^ices the SMR 700. 
The interrupt bus functions as a 16 state counter, with each SMR al towed to interrupt only during one of the 16 states. 
45 When operating in the EISA mode, the IBUS interface 720 becomes bi-directional. The SMR then senses the state 

of the IBUS when not driving. When the IBUS is driven low, the corresponding EISA interrupt request is asserted. The 
int n-upt requests preferably are asserted through the general purpose block logic 715. The IBUS logic includes an 
8-bit register 722 which kientifies the source of the interrupt. Another 8-bit register 724 stores the interrupt address in 
the first four bits. The interrupt address causes the interrupt to be asserted in the slot corresponding to the hexadecimal 
so value in the first four bits of register 724. 

The JTAG slave togic 705 is based upon the Compaq JTAG slave standard cell design; however, the standard cell 
d sign is modified to permit the JTAG lines to be used for communication to all registers within the SMR. The SMR is 
c nnected to the SMC and the other SMR's via the IEEE 1149.1 compatible standard test bus (JTAG bus [4:0]) set up 
in a ring configuration. The slave togic 705 includes a JTAG multiplexer (not shown specifically) that expands the JTAG 
ss chain, allowing the JTAG slave to select any of six JTAG chains with whbh the SMR interfaces. 

Upon being pow red up, the IIC address register 712 and clock register 714 must b initialized. The SMR initial- 
ization state machine controls the IIC control togic 710 to read in initial values for thes regist rs. The SMR pr f rably 
attaches to a non-volatile EEPROM via the IIC bus. which stor s the initialization values. One initialized, the SMR 
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connects to varbus sensors through an analog-to-dtgital converter(s) for monitoring temperature, v Itage lev (s, n is 
in various compon nts, system bus utilization, and other functions. The SMR also perfomris c rtain miscellaneous 
functions such as fan spe d monitoring, rror display control for the front panel of th computer system, power man- 
agement for en rgy consen^ati n. tamper reporting, and serial number management. The infomnation in the EEPROM 

s configures th SMR's regist rs, counters and timers, and also preferably st res historical information regarding th 
SMR and the parameters monitored by th SMR. 

Thus, in summary, if a failure occurs on the EISA bus, the EISA monitoring SMR can reset the EISA bus, r can 
. r set slots on the EISA bus. If problems continue oh the EISA bus, the PCI-EISA bridge can be disabled by the PCI 
arbiter and nx>nitor logic 225. The error is then reported via the SM modem or SM NIC to a remote site. 

10 Referring again to Figure 4, the PCI arbitration unit 225 preferably is capable of assigning priority based upon the 

master target combinations. Most arbitration schemes are based either on a round robin approach with no prioriti s, 
or have priorities assigned to each master. In the preferred embodiment of the present invention, the master4arget 
combination is used to determine priorities. In accordance with the preferred embodiment, the SMC registers 260 
preferably includes priority register files identifying high priority master target pairs. Thus, four registers maybe provid d 

IS to identify high prbrity master-target combinations, as shown in Table V: 



TABLE V 



20 



PRIORITY REGISTER # 


MASTER 


TARGET 


1 


AAAAAAAA 


WVWWA/VWWW 


2 


BBBBBBBBB 


XXXXXXXXXX 


3 


cccccccc 


YYYYYYYYYY 


4 


DDDDDDD 


/////////// 



Thus, as shown in the flow chart of Figure 9, the arbitration and monitor logic 225 determines in step 401 whether 
a mastership request appears on the PCI bus. If a rhastership riequest does appear, the arbitration logic 225 awards 
mastership in step 403 based upon either a round robin approach, or based upon priorities assigned to specific masters, 
in accordance with conventional techniques. After mastership is granted, the arbitration and monitor logic 225 deter- 
mines the target of the current cycle in step 405. In step 407, the arbitration and monitor logic 225 checlcs the curr nt 
master-target combination to detemrilne if the combination has been determined to be a high priority combination, as 
specified in the high priority registers (see /or esxamp/e TABLE V). 

If the arbitration logic determines in step 409 that the current master-target combination is not a high priority com- 
bination, the arbitration logic returns to a normal mode of operation. Conversely, if the present master-target is a high 
priority corhbinatloh^ the arbitration and monitor logic 225 determines In step 411 if the current master seeks another 
mastership cycle, if so, the sirbitration logic 225 awards mastership to the current master in step 413. The arbitration 
logic then loops to step 405 to once again determine the target of the current cycle. 

Thus, after granting mastership of the PCI bus, the arbitration and monitor logic 225 preferably monitors the target 
address. Once the target address is determined for the current transaction, the arbitration and monitor logic 225 com- 
pares the master and target address with the combinations listed in the priority registers (lable V). If a match is found, 
that master is elevated to the highest priority as long as that master continues to request mastership of the bus. One 
skilled in the art will understand that more or less than four registers may be provided to specify greater or fewer thari 
f ur high priority master-target combinations. In addition, one register may be used to encode more than one master- 
target pair depending upon the register size and the numk>er of bits required to specify the address of the master and 
target Further, although the present invention is discussed by referencing the PCI bus, one skilled In the art will un- 
d rstand that the present invent bn can be used on other bus systems as well which have bus mastership capabiliti s. 

A conventional round robin arbitratk>n scheme is shown in Figure 11 A. Six masters (Ml , M2, M3, M4, M5, MB) ar 
d picted for purposes of illustratkxi. In conventional round robin iogk:, the arbitratton unit sequentially rotates to each 
master to took for a grant request. In the absence of a grant request, the next master is checked for a grant request. 

In the hot master/target scheme shown in Figure 11 B, the M2 master has been detemnlned to be addressing a 
target that has been predetermined to be a high priority masterAarget pair (such as, for example, a CPU master and 
a video controller slave). Thus, the arbitration unit retums to the high priority M2 Master after completing a mastership 
cycle with another master. Thus, as shown in Figure 11 B, after a Ml Master cycle, M2 is granted mastership. The n xt 
master cycle is granted to M3, with M2 again given a grant in the next master cycle. In this fashk>n, the high priority 
master is given every other mastership cycle which addressing a high priority target. 

Referring still to Figure 4, interrupt routing logic 235 receives interrupt requests from varbus components in the 
system. Thus, interrupt routing logic 285 receives int rrupt requests originating on the PCI bus 50. interrupt requ sts 
originating on th EISA bus, PCMCIA interrupt requests, mouse and keyboard interrupt requests, and other requests 
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made on r transmitted via th APIC bus. In addition. Int rrupt routing logic 235 preferably receives signals from the 
SMR'8 71,72. 73(Figur 2) via the Gl bus 70. Int rrupt logic 235 also rec ives a signal from the SMR 76 via th ICIB 
bus 65. Both the Gl bus 70 and the ICIB bus 65 preferably are bi-directional serial buss s which use time division 
multiplexing to schedule transmissions. If the source of an interrupt cann t be det rmined, the IEEE 1149.1 standard 
5 test bus Is us dt assist in locating the source of interrupts. As sh wn on Figur 4. signals on the APIC bus are routed 
thr ugh APIC I/O 215. An IEEE 1149.1 compatible bus contr Her 220 functions to couple the SMC 200 to the IEEE 
1 1 49. 1 standard test bus. The IE EE 1 1 49. 1 c mpatible bus controller 220 coupl s to command logic 250. In accoidance 
with the preferred embodiment, command logic 250 initiates cycles on the IEEE 1149.1 standard test bus 81 to assist 
in locating the source of Interrupts. In addition, other control signals may be transferred through the IEEE 1149.1 

10 standard test bus 81. Thus, for example, in instances where the PCI bus is inoperable, the SMC 200 still may perform 
cycles to other system components via the IEEE 1149.1 standard test bus. Thus, in a failed system, the SMC 200 can 
interrogate other system components through test vectors stored in EEPROM, which are generated using Texas In- 
strument Asset tools and Victory software. The IEEE 1149.1 compatible bus controller 220 preferably Includes both a 
master block and a stave block to permit two SMC units to communicate if more than one SMC unit is provided in the 

ts system.EachblockhasitsownsetofJTAGpinssothatthelEEE 1149.1 standardtest bus 81 may be used to configure 
and control the SMR*s, and to write to other devices on an tIC bus (or other bus) via a JTAG-to-IIC bridge in the SMR. 
The SMC IEEE 1 1 49. 1 standard test controller 220 is accessible by both the SMP 1 50 and the host CPU 1 0. 

Referring now to Figures 4 and 5. the SMC 200 collects remote interrupt requests for the SMP 150 and the host 
CPU 10 to read. Several SMP state machines detect and report abnormal conditions by setting error bits. Once the 

20 SMC detects an error conditk>n. it interrupts the SMP. If a catastrophic failure is detected, the SMC asserts a syst m 
error (SERR) signal whk:h is connected to the EISA controller ESC to produce an NMI signal. The SMC stores the first 
error detected in a first error register and all detected errors in error register to altow a more intelligent failure analysis 
The interrupt routing scheme according to the preferred embodiment is shown in Figure 5. All interrupts from the 
SMM components (such as PCMCIA adapters 1 94, 1 95, SMP 1 50. keyboard and mouse controller 1 85. and SMR 325) 

2S are routed through SMC 200. The SMC 200 also receives interrupt requests from SMR's 71 , 72, 73 on Gl bus 70, and 
from SMR 76 on ICIB bus 65. The SMC 200 also receives EISA intenrupt signals on the APIC bus 80. Intemipts are 
r uted from the SMC 200 to SMP 150 on APIC bus 80. In one embodiment of the invention, a second SMC 300 is 
provided on a secondary PCI bus 350. The second SMC 300 couples to the APIC bus 80 and ICIB bus 65 to assist in 
processing interrupt requests from the secondary PCI bus. 

^ By routing ail interrupt requests through the system management central (SMC), the SMC 200 is capable of mask- 

ing interrupt signals from disabled PCI and EISA components. As the interrupt control devk:e for the system, the SMC 
200 also can reassign interrupts to devk:es in the system, including hot spare components in one of the embodiments 
of the present inventk)n. In addition, the SMC facilitates the operatkxi of the SMM 100 in a virtual tenminal mtode 
because all interrupts come through the SMM. 

35 The Gl bus and the ICIB bus preferably are busses connecting the SMR's and the SMC's. Preferably, the SMR 

has common pins for the Gl and ICIB busses. Thus, the SMR is configured for operating in either mode. The Gl bus 
and ICIB busses only require four lines, three of which already are present in the system (Vcc, ground, and the JTAG 
clock). The fourth line is a bi-dtrectionai data line. Referring now to Figures 2 and 5, Gl bus 70 couples SMR 71, SMR 
72 and SMR 73 to the system management module 100. The ICIB bus 65 couples the SMR 76 located on the EISA 

^ bus to the system management module 100. The Gl bus 70 and the ICIB bus 65 permit bi^lirectkjnal transmission of 
interrupts from different sources with the SMM 100 being able to infer information about the interrupt by knowing the 
time during which the ffiterrupt was asserted. The Gl bus and ICIB bus use time divisk>n multiplexing schemes to saf ly 
alk>w multiple devk:es to assert their interrupts on the same line for the SMC to sample. The SMC may also drive the 
same line. Thus, the Gl bus and the ICIB bus are used to pass interrupt signals from remote devk;es to the SMM 100. 

^ Interrupts on the EISA bus may also be transmitted to the SMM via the ICIB bus 65. Each of the various interrupt 
transmlssk>n routes preferably are individually programmable and maskable by the SMM. 

In the preferred embodiment of Figure 4, the system management processor 1 50 dynamically ascertains the JTAG 
topology as part of system initialization. In accordance wrth known techniques described in the IEEE 1149.1 standard 
test access port description, software running on the SMP 1 50 performs an ID scan to determine what chips and what 

so JTAG topology are present in the system. After determining that the system contains one or more SMR's, which serve 
as JTAG bridges, the software can use the SMR's to connect to other JTAG chains and perform an ID scan to detenrntne 
the topology of this chain. As shown in Figure 4, the SMC 200 preferably includes a IEEE 11 49. 1 standard test controller 
220. The 200 altows the SMP 150 to run a boundary scan of all scanable components in the computer system. The 
IEEE 1149.1 compatible bus controller 220 includes a response FIFO set which allows the SMP 150 to pipeline several 

^ commands to the JTAG port. The IEEE 1149.1 compatible bus controller 220 is capable of reporting the various con- 
ditkms of the computer system to the perator via the computer scr en display or through the modem r local area 
network connected through th PCMCIA sk)ts 1 95, 1 96. 

The IEEE 1149.1 compatibi bus controller 220 interfaces the SMC 200 to th IEEE 1149.1 standard t st bus 81. 
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Each of the SMR's 71. 72, 73, 76 preferably includes a regist r to control multiplexing in the SMR. The regist rpr f rably 
IS included in the JTAG data chain to p rmit modification of the state of the multiplexer. A JTAG roaster device permits 
the IEEE 1149.1 standard t st bus to be us d f r testing and management communications In accordance with the 
preferred embodiment. Thus, the IEEE 1149.1 standard test bus Is used to perform serial communications between 

s the SMR and SMC as well as function as a t st in accordance with normal techniques. In the pr ferred embodiment, 
PCI masters can scan the SMC and thus the SMP via the IEEE 1149.1 standard test bus. Thus, for example, the host 
CPU can scan the syst m management module via th IEEE 1149.1 standard test bus if d sired. 

The system management remote (SMR) devices 71 . 72, 73 connect to the SMC 200 for gathering interrupt infor- 
mation from remote sources, such as various processor, memory, and I/O cards. The SMR's are implemented with 

10 relatively small ASIC's to minimize cost. Thus, one SMC can be used with many SMRs to allow the SMM 1 00 to nrK>nitor 
the rest of the computer system. One SMR preferably is physically located in the SMM 100, and the other SMR's are 
located in various renrK>te modules, interrupts provided by SMR's are gathered using time division multiplexing by the 
interrupt controller 235 In the SMC 200 over a bus GIBUS 70. The SMR 76 is located in the expansion subsystem. 
The SMR 76 receives the EISA IRQ signals and passes the interrupts via the bi-directional serial link bus ICIB 65 to 

IS the interrupt controller 235. The function of the SMR's could also include filtering the interrupts to remove positive 
glitches and stretching negative pulsing interrupts. Once the SMC 200 receives an Interrupt from one of the SMRs, it 
performs a determination of the source of the interrupt. If the source of an interrupt is unknown due to slot sharing, the 
SMP 150 scans the SMR's registers via the IEEE 1149.1 compatible bus controller 220 to determine the interrupt source. 
Referring still to Figure 4, APIC IAD 215 functions to interface the SMC 200 to the APIC bus 80. in the preferred 

20 embodiment, the SMC 200 implements a remote read operation used in the I/O APIC which permits the SMC 200 to 
read the host processor's internal states as part of a diagnostic feature. The I/O APIC logic 21 5 also preferably supports 
interrupt routing from the SMC to the host on the APIC bus 80 in the event that the host CPU does not support APIC 
protocol. 

In the preferred embodiment, the SMC survives a system power down and also is capable of surviving a PCI clock 

2S failure. The SMC 200 preferably reports other ckx:k failures such as an APIC clock failure. The SMP monitors temper- 
ature and fan speed and can identify blocked or failed fans and even predk:t eminent over temperature or eminent fan 
failure. Temperature preferably is read from a temperature sensor attached to the SMR via the Philip's IIC bus: SMR 
76 preferably couples to the SMM via IEEE bus. Thus, the SMR 76 can transmit to the system management modul 
100 Infonmatkxi regarding asset management, failure modes, run time k>gs, diagnostic or operating systems which 

30 flagged any failure, and other informat»n. ^ 

In the preferred embodiment, the SMC 200 includes two individual registers for storing error values. One regist r 
latches the first occurrence of an error and the second register latches all occurrences of errors. The SMC stores th 
first error detected in the first error register and all detected errors in the error register to alte>w a nrK>re intelligent failure 
analysis. The SMC 200 also latches and reports PCI address, data parity errors, SERR#, clock failures, and all failures 

35 can be nnasked. The SMC also supports reporting via the interrupt or other signal lines to the host or system manage- 
ment module. The SMC also includes self error register bits as well as EISA and system hang logic. 

The foregoing disclos;ure and description of the invention are illustrative and explanatory thereof, and various 
changes in the size, shape, materials, components, circuit elements, wiring connections and contacts, as well as in 
th details of the illustrated circuitry and constructton and method of operation may be made without departing from 

40 th spirit of the inventk>n. 

Numerous variations and nrKxJifications will become apparent to those skilled in the art once the above disck>sur 
is fully appreciated. It Is Intended that the foltowing claims be interpreted to embrace all such variatkxis and modifica- 
tkxts. 

45 

Claims 

1. A computer system, comprising: 

so a host processor connected to a host bus; 

a bus bridge connecting said host bus to a system expansion bus; 

a system management module connected to saki system expansion bus, said system management module 
including: 

ss a system management processor connected to a system management local bus; 

a system management central control unit connected to said system management local bus and to the 
expansion system bus; 

wherein said system management c ntral control unit includes an arbiter and monitor togic which operates 
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as a bus arbiter for the expansion bus, and interrupt routing logic for selectively routing interrupt r quests 
t the host processor originating from connponents on the expansion bus. 

2. A system as in claim 1, wherein the syst m xpansion bus comprises a PCI bus, and the syst m management 
s central control unit perat sasth busarbit r for the PCI bus. 

3. A system as in claim 2, further comprising a second system expansion bus coupled t said PCI bus by a second 
bus bridge. 

10 4. A system as In claim 3, wherein the second expansion bus comprises an EISA bus. 

5, A system as in claim 3. wherein the second bus bridge couples to said system management central control unit 
and to said host processor through an interrupt bus. 

IS 6. A system as in claim 6, wherein the interrupt bus comprises an APIC bus. 

7. A system as in claim 5. wherein the second bus bridge functions to relay intermpt requests from devices connected 
to the second system expansion bus. 

^ a A system as in claim 7, further comprising a system management remote unit connected to said second system 
expansion bus, said system management remote unit connecting to said system nnanagement central control unit 
by a serial bus. 

9. A system as in claim 8. wherein the system management remote unit provides a signal to said system management 
central control unit in the event that a device connected to the second system expansion bus fails. 

10. A system as in claim 2, wherein the system management central control unit receives requests for mastership 
ofthe PCI bus from PCI master devices connected to the PCI bus. 

30 11. A system as In claim 10, wherein the arbiter and monitor logic receives the requests for mastership and grants 
mastership of the PCI bus to requesting masters. 

12. A system as in claim 11, wherein the interrupt routing togic receives interrupt requests from devices connected to 
the PCI bus. 

3S 

1 3. A system as in claim 1 2, wherein the art^iter and monitor logic monitors the PCI bus to determfrte if any compon nts 
connected to the PCI bus have failed. 

14. A system as in claim 4, further comprising a plurality of system management remote units coupled to said syst m 
^ management central control unit 

15. A system as in claim 14, wherein said system management remote units couple to said system rnanagem nt 
central control unit by a standard test bus. 

45 16. A system as in claim 15, wherein said system management renxste units also couple to said system managem nt 
central control unit by a bi-directional bus. 

17. A system as in claim 15, wherein at least one of the system management remote units also couple to the system 
management central control unit by an ICIB bus. 

ia A system as in claim 17, wherein at least one of the system management remote units also couple to the system 
management central control unit by a Gl bus. 

19. A system as in claim 18, wherein said system management central control unit includes common terminals for 
connecting to both the ICIB bus and the Gl bus. 

20. A system as in claim 14, wherein at least one of the system management r mote units couple to an IIC bus. 



so 



ss 
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21. A system as in claim 14, wherein one of said plurality of system management r rnote units also couples t th 
EISA bus. 

22. Asyst masinclaim14, wher inoneof said plurality of system management remote units also couples to th bus 
s bridge. 

23. A system as in claim 14, wherein on of said plurality of system management r mote units also couples to th 
second bus bridge. 

to 24. A system as in claim 14, wherein one of said plurality of system management remote units also couples to the 
host bus. 

25. A computer system, comprising: 

15 a host processor connected to a host bus; 

a bus bridge connecting said host bus to a PCI bus; 

a system management module connected to said PCI bus, said system management module including a 
system management central control unit connected to said PCI bus; 

wherein said system management central control unit includes an arbiter and monitor logic which operates as 
20 a PCI bus arbitration unit, and said arbiter and monitor bgic receives requests for mastership of the PCI bus 

from PCI master devices connected to the PCI bus, and said arisiter and monitor logicmonltors the PCI bus 
to determine if any components connected to the PCI bus have failed. 

26. A computer system, comprising: 

2S 

a host processor connected to a host bus; 

a bus bridge connecting said host bus to a PCI bus; 

a system management module connected to said PCI bus. said system management modulie including a 
system management central control unit connected to said PCI bus; 
30 : wherein said system management central control unit includes monitor logic to monitor the PCI bus to deter- 

mine if any components connected to the PCI bus have failed, and intermpt routing logic for selectively routing 
interrupt requests to the host processor originating from components on the PCI bus. 

27. A computer system, comprising: 

3S 

a host processor connected to a host bus; 

a bus bridge connecting said host bus to a PCI bus; 

a second bus bridge connected to said PCI bus and to a system expansion bus; 

a system management module connected to said PCI bus, said system management module including a 
40 system management central control unit connected to said PCI bus; 

said system management central control unit including interrupt routing logic for selectively routing interrupt 
requests to the host processor; 

a system management remote unit connected to said system expansion bus, said system management remote 
unit connecting to said viterrupt routing logic by a serial bus; 
45 and said second bus bridge connects to said interrupt logic by an interrupt bus, and said second bus bridge 

relays interrupt requests from devices connected lO the system expansion bus. 

28. A system as in claim 12 or claim 25. wherein the arbiter and monitor logic ignores requests for nnastership from 
failed devices on the PCI bus. 

so 

29. A system as in claim 28, wherein the system management central control unit also includes interrupt routing logic 
for receiving interrupt requests from devices connected to the PCI bus. 

30. A system as in claim 14. claim 26 or claim 29, wherein the system management central control unit masks th 
55 interrupts from failed devices on the PCI bus. 

31. A system as in claim 30, wher in one of the devic s requesting nr^stership of th PCI bus is a PCI to EISA bus 
bridge, and the PCI-to-EISA bus bridge interfaces the PCI bus to an EISA bus. 
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32. A system as in claim 31 , wh rein the PCI-to-EISA bus bridg connects to said Interrupt routing logic through an 
interrupt bus to route all Int rrupts from the EISA bus to the syst m managem nt central control unit. 

33. A system as in claim 32, further comprising a system management r mote unit connected to said EISA bus, said 
5 second managem nt r mote unit conn cting to said syst m management central c ntrol unit by a serial bus. 

34. A system as in claim 33, wh rein the serial bus comprises a high spe d bi-directional inter-chip interrupt bus. 

35. A system as in claim 34, wherein the system management remote unit provides a signal to said system manage- 
10 ment central control unit in the event that a device connected to the EISA bus fails. 

36. A system as in claim 25 or claim 30, further comprising a system management processor coupled to said system 
management central control unit through a system management local bus. 

IS 37. A system as in claim 27, wherein the system expansion bus comprises an EISA bus, and the second bus bridge 
comprises a PCI-to-EISA bridge. 

38. A system as In claim 27, wherein the system management remote unit provides a signal to said system manage- 
ment central control unit in the event that a device connected to the system expansion bus fails. 

20 

39. A system as in claim 9, claim 35 or claim 38, wherein the interrupt routing logic Is ciapable of masking interrupt 
requests relayed by the second bus bridge in the event that the device requesting an Interrupt has failed. 
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